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Preface 


Q: What’s the difference between an argument and a proof? A: An argu- 
ment will convince a reasonable person, but a proof is needed to convince 
an unreasonable one. 

Anon. 


Die Mathematiker sind eine Art Franzosen: Redet man zu ihnen, so 
bersetzen sie es in ihre Sprache, und dann ist es alsbald ganz etwas 
anderes. (Mathematicians are like Frenchmen: whatever you say to them 
they translate into their own language and forthwith it is something 
entirely different.) 

Johann Wolfgang von Goethe 

This book offers a concise overview of some of the main topics in differential 
geometry and topology and is suitable for upper-level undergraduates and begin- 
ning graduate students in mathematics and the sciences. It evolved from a set of 
lecture notes on these topics given to senior-year students in physics based on the 
marvelous little book by Flanders [25], whose stylistic and substantive imprint 
can be recognized throughout. The other primary sources used are listed in the 
references. 

By intent the book is akin to a whirlwind tour of many mathematical countries, 
passing many treasures along the way and only stopping to admire a few in detail. 
Like any good tour, it supplies all the essentials needed for individual exploration 
after the tour is over. But, unlike many tours, it also provides language instruction. 
Not surprisingly, most books on differential geometry are written by mathemati- 
cians. This one is written by a mathematically inclined physicist, one who has 
lived and worked on both sides of the linguistic and formalistic divide that often 
separates pure and applied mathematics. It is this language barrier that often causes 
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the beginner so much trouble when approaching the subject for the first time. Con- 
sequently, the book has been written with a conscious attempt to explain as much 
as possible from both a “high brow” and a “low brow” viewpoint , 1 particularly in 
the early chapters. 

For many mathematicians, mathematics is the art of avoiding computation. Sim- 
ilarly, physicists will often say that you should never begin a computation unless 
you know what the answer will be. This may be so, but, more often than not, 
what happens is that a person works out the answer by ugly computation, and then 
reworks and publishes the answer in a way that hides all the gory details and makes 
it seem as though he or she knew the answer all along from pure abstract thought. 
Still, it is true that there are times when an answer can be obtained much more eas- 
ily by means of a powerful abstract tool. For this reason, both approaches are given 
their due here. The result is a compromise between highly theoretical approaches 
and concrete calculational tools. 

This compromise is evident in the use of proofs throughout the book. The one 
thing that unites mathematicians and scientists is the desire to know, not just what 
is true, but why it is true. For this reason, the book contains both proofs and com- 
putations. But, in the spirit of the above quotation, arguments sometimes substitute 
for formal proofs and many long, tedious proofs have been omitted to promote the 
flow of the exposition. The book therefore risks being not mathematically rigorous 
enough for some readers and too much so for others, but its virtue is that it is nei- 
ther encylopedic nor overly pedantic. It is my hope that the presentation will appeal 
to readers of all backgrounds and interests. 

The pace of this work is quick, hitting only the highlights. Although the writing 
is deliberately terse the tone of the book is for the most part informal, so as to 
facilitate its use for self-study. Exercises are liberally sprinkled throughout the text 
and sometimes referred to in later sections; additional problems are placed at the 
end of most chapters . 2 Although it is not necessary to do all of them, it is certainly 
advisable to do some; in any case you should read them all, as they provide flesh for 
the bare bones. After working through this book a student should have acquired all 
the tools needed to use these concepts in scientific applications. Of course, many 
topics are omitted and every major topic treated here has many books devoted to 
it alone. Students wishing to fill in the gaps with more detailed investigations are 
encouraged to seek out some of the many fine works in the reference list at the end 
of the book. 


1 The playful epithets are an allusion, of course, to modern humans (abstract thinkers) and Neanderthals (concrete 
thinkers). 

2 A solutions manual is available to instructors at www.cambridge.org/9781 107042193. 
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The prerequisites for this book are solid first courses in linear algebra, multi- 
variable calculus, and differential equations. Some exposure to point set topology 
and modern algebra would be nice, but it is not necessary. To help bring stu- 
dents up to speed and to avoid the necessity of looking elsewhere for certain 
definitions, a mathematics primer is included as Appendix A. Also, the begin- 
ning chapter contains all the linear algebra facts employed elsewhere in the book, 
including a discussion of the correct placement and use of indices. 1 This is fol- 
lowed by a chapter on tensors and multilinear algebra in preparation for the study 
of tensor analysis and differential forms on smooth manifolds. The de Rham coho- 
mology leads naturally into the topology of smooth manifolds, and from there to a 
rather brief chapter on the homology of continuous manifolds. The tools introduced 
there provide a nice way to understand integration on manifolds and, in particular, 
Stokes’ theorem, which is afforded two kinds of treatment. Next we consider vector 
bundles, connections, and covariant derivatives and then manifolds with metrics. 
The last chapter offers a very brief introduction to degree theory and some of its 
uses. This is followed by several appendices providing background material, cal- 
culations too long for the main body of the work, or else further applications of the 
theory. 

Originally the book was intended to serve as the basis for a rapid, one-quarter 
introduction to these topics. But inevitably, as with many such projects, it began 
to suffer from mission creep, so that covering all the material in ten weeks would 
probably be a bad idea. Instructors laboring under a short deadline can, of course, 
simply choose to omit some topics. For example, to get to integration more quickly 
one could skip Chapter 5 altogether, then discuss only version two of Stokes’ the- 
orem. Instructors having the luxury of a semester system should be able to cover 
everything. Starred sections can be (or perhaps ought to be) skimmed or omitted 
on a first reading. 

This is an expository work drawing freely from many different sources (most 
of which are listed in the references section), so none of the material har- 
bors any pretense of originality. It is heavily influenced by lectures of Bott and 
Chern, whose classes I was fortunate enough to take. It also owes a debt to the 
expository work of many other writers, whose contributions are hereby acknowl- 
edged. My sincere apologies to anyone I may have inadvertently missed in my 
attributions. The manuscript itself was originally typeset using Knuth’s astonish- 
ingly versatile TpX program (and its offspring, FTpX), and the figures were made 
using Timothy Van Zandt’s wonderful graphics tool pstricks, enhanced by the 
three-dimensional drawing packages pst-3dplot and pst-solides3d. 

3 Stephen Hawking jokes in the introduction to his Brief History of Time that the publisher warned him his 
sales would be halved if he included even one equation. Analogously, sales of this book may be halved by the 
presence of indices, as many pure mathematicians will do anything to avoid them. 
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It goes without saying that all writers owe a debt to their teachers. In my case I 
am fortunate to have learned much from Abhay Ashtekar, Raoul Bott, Shing Shen 
Chern, Stanley Deser, Doug Eardley, Chris Isham, Karel Kuchar, Robert Lazars- 
feld, Rainer Kurt Sachs, Ted Shifrin, Lee Smolin, and Philip Yasskin, who naturally 
bear all responsibility for any errors contained herein ... I also owe a special debt 
to Laurens Gunnarsen for having encouraged me to take Chern’s class when we 
were students together at Berkeley and for other very helpful advice. I am grateful 
to Rick Wilson and the Mathematics Department at the California Institute of Tech- 
nology for their kind hospitality over the course of many years and for providing 
such a stimulating research environment in which to nourish my other life in com- 
binatorics. Special thanks go to Nicholas Gibbons, Lindsay Barnes, and Jessica 
Murphy at Cambridge University Press, for their support and guidance through- 
out the course of this project, and to Susan Parkinson, whose remarkable editing 
skills resulted in many substantial improvements to the book. Most importantly, 
this work would not exist without the love and affection of my wife, Alison, and 
our sons David and Michael. 
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Linear algebra 


Eighty percent of mathematics is linear algebra. 

Raoul Bott 

This chapter offers a rapid review of some of the essential concepts of linear algebra 
that are used in the rest of the book. Even if you had a good course in linear algebra, 
you are encouraged to skim the chapter to make sure all the concepts and notations 
are familiar, then revisit it as needed. 


1.1 Vector spaces 

The standard example of a vector space is W, which is the Cartesian product of M 
with itself n times: M" = Rx • ■•xR.A vector v in M" is an n-tuple («i, a 2 > • • ■ , a n ) 
of real numbers with scalar multiplication and vector addition defined as 
follows' : 


c(a i, 02 , . . . , a„ ) := {ca i, caj, . . . , ca n ) (1.1) 

and 

(fli, ci2, ■ . . , a n ) + (b\, &2) • • • > b n ) '■= («i + b\, ci2 + b2, ■ ■ ■ , a n + b n ). (1.2) 
The zero vector is (0, 0, ... , 0). 

More generally, a vector space V over a field IF is a set {u, v, w, . . . } of objects 
called vectors, together with a set {a, b, c, . . .} of elements in F called scalars, 
that is closed under the taking of linear combinations: 

u,veV and a, b e¥ au + bv e V, (1.3) 

and where On = 0 and \ v = v. (For the full definition, see Appendix A.) 


The notation A := B means that A is defined by B. 
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A subspace of V is a subset of V that is also a vector space. An affine subspace 
of V is a translate of a subspace of V. 1 The vector space V is the direct sum of 
two subspaces U and W, written V = U ® W, if t/ IT VT = 0 (the only vector 
in common is the zero vector) and every vector v e V can be written uniquely as 
v = u + w for some u e U and w € W . 

A set {u,} of vectors 2 is linearly independent (over the field IF) if, for any 
collection of scalars {<?,•} c IF, 



implies c,- = 0 for all i . 


(1.4) 


Essentially this means that no member of a set of linearly independent vectors may 
be expressed as a linear combination of the others. 

EXERCISE 1.1 Prove that the vectors (1, 1) and (2, 1) in R 2 are linearly inde- 
pendent over M whereas the vectors (1,1) and (2, 2) in M 2 are linearly dependent 
over M. 

A set B of vectors is a spanning set for V (or, more simply, spans V) if every 
vector in V can be written as a linear combination of vectors from B. A spanning 
set of linearly independent vectors is called a basis for the vector space. The cardi- 
nality of a basis for V is called the dimension of the space, written dim V. Vector 
spaces have many different bases, and they all have the same cardinality. (For the 
most part we consider only finite dimensional vector spaces.) 

Example 1.1 The vector space M" is n -dimensional over M. The standard basis is 
the set of n vectors {e\ = (1,0,..., 0), ei = (0, 1, . . . , 0), . . . , e n = (0, 0, . . . , 1)}. 

Pick a basis {c, } for the vector space V. By definition we may write 



(1.5) 


for any vector v e V, where the i>, are elements of the field IF and are called the 
components of v with respect to the basis {<?,}. We get a different set of components 
for the same vector v depending upon the basis we choose. 

EXERCISE 1.2 Show that the components of a vector are unique, that is, if v = 
J2i v ‘ e i = J2i v 'i e i then v i = v r 

EXERCISE 1.3 Show that dim (U 0 W) = dim U + dim W. 

1 By definition, W is a translate of U if, for some fixed v e V with v ^ 0, W = [u + v : u e U}. An affine 
subspace is like a subspace without the zero vector. 

2 To avoid cluttering the formulae, the index range will often be left unspecified. In some cases this is because 
the range is arbitrary, while in other cases it is because the range is obvious. 
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EXERCISE 1.4 Let W be a subspace of V. Show that we can always complete a 
basis of W to obtain one for V. In other words, if dim W — in and dim V — n. 
and if {fi, . . . , f m ] is a basis for W, show there exists a basis for V of the form 
{f\, . . . , f m , g\, . ■ ■ , gn-m}- Equivalently, show that a basis of a finite-dimensional 
vector space is just a maximal set of linearly independent vectors. 


1.2 Linear maps 

Let V and W be vector spaces. A map T : V — »■ W is linear (or a homomorphism) 
if, for vi, V 2 € V and cq, a 2 e IF. 


T(aiVi +a 2 v 2 ) = a { T v\ + a 2 T v 2 . (1.6) 

We will write either T (v) or T v for the action of the linear map T on a vector v. 

EXERCISE 1.5 Show that two linear maps that agree on a basis agree everywhere. 

Given a finite subset U := \u \ , Lii, . . . } of vectors in V, any map T : U —> W 
induces a linear map T : V — »■ W according to the rule T QA a,w,) := a,T 
The original map is said to have been extended by linearity. 

The set of all v € V such that Tv = 0 is called the kernel (or null space) of T, 
written ker T ; dim ker T is sometimes called the nullity of T . The set of all w e W 
for which there exists at e V with Tv = w is called the image (or range) of T, 
written im T . The rank of 7\ rk 7\ is defined as dim im T . 

EXERCISE 1.6 Show that ker T is a subspace of V and im T is a subspace of W. 

EXERCISE 1.7 Show that T is injective if and only if the kernel of T consists of the 
zero vector alone. 

If T is bijective it is called an isomorphism, in which case V and W are said to 
be isomorphic; this is written as V = W or, sloppily, V = W . Isomorphic vector 
spaces are not necessarily identical, but they behave as if they were. 

Theorem 1.1 All finite-dimensional vector spaces of the same dimension are 
isomorphic. 

EXERCISE 1.8 Prove Theorem 1.1. 

A linear map from a vector space to itself is called an endomorphism, and if it is 
a bijection it is called an automorphism. 


3 


Physicists tend to call an endomorphism a linear operator. 
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EXERCISE 1.9 A linear map T is idempotent if T 2 = T . An idempotent endo- 
morphism jr : V — > V is called a projection (operator). Remark: This is not to 
be confused with an orthogonal projection, which requires an inner product for its 
definition. 

(a) Show that V = im n © ker jt . 

(b) Suppose W is a subspace of V. Show that there exists a projection operator 
n : V — > V that restricts to the identity map on W. (Note that the projection 
operator is not unique.) Hint: Complete a basis of W so that it becomes a basis 
of V. 

EXERCISE 1.10 Show that if T : V — > V is an automorphism then the inverse map 
T~ l is also linear. 

EXERCISE 1.11 Show that the set Aut V of all automorphisms of V is a group. (For 
information about groups, consult Appendix A.) 


1.3 Exact sequences 

Suppose that you are given a sequence of vector spaces Vj and linear maps 
< 4 >i : Vi — > Vj+i connecting them, as illustrated below: 


-+ Vi-i 


Vi - 1 


> Vi 


Vi+1 


Vi+l 


The maps are said to be exact at V, if im qj,_\ = ker<p,, i.e., the image of <p,_i 
equals the kernel of cpi . The sequence is called an exact sequence if the maps are 
exact at Vj for all i. Exact sequences of vector spaces show up everywhere and 
satisfy some particularly nice properties, so it worth exploring them a bit. 

If Vj, Vi, and V 3 are three vector spaces, and if the sequence 


0 Vl -ZU 0 (1-7) 

is exact, it is called a short exact sequence. In this diagram “0” represents the 
zero-dimensional vector space, whose only element is the zero vector. The linear 
map (p 0 sends 0 to the zero vector of Vj , while <p 3 sends everything in V 3 to the zero 
vector. 


EXERCISE 1.12 Show that the existence of the short exact sequence (1.7) is 
equivalent to the statement “<p\ is injective and <P 2 is surjective.” In particular, if 

0 * V — W > 0 I 1 - 8 ) 


is exact, V and W must be isomorphic. 
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It follows that if T : V — »■ IV is surjective then 

0 * kei-T — V — IV * 0 ( L9 ) 

is a short exact sequence, where t is the inclusion map. 4 By virtue of the above 
discussion, all short exact sequences are of this form. 

Theorem 1.2 Given the short exact sequence (1.9), there exists a linear map 
S : W — > V such that T o S = 1. We say that the exact sequence (1.9) splits. 

Proof Let {/, } be a basis of W. By the surjectivity of T, for each i there exists an 
Cj € V such that T (<?, ) = /,-. Let S be the map /,• e, extended by linearity. The 
composition of linear maps is linear, so T o S = 1 . (Note that S must therefore be 
injective.) □ 

Remark The map S is called a section of T . 

Theorem 1.3 Let the short exact sequence (1.9) be given, and let S be a section 
ofT. Then 

V = kerT © S(W). 

In particular, dim V = dim ker T + dim W. 

Proof Let v e V, and define w := S(T(v)) and u := v — w. Then T (u) = 
T(v — w ) = T(v) — T ( S(T (V))) = 0, so v = u + w with u e ker T and w e >S(IV). 
Now suppose that x e kerT n 5(VT). The map S is injective, so W = S(W). 
In particular, there exists a unique w € W such that S(w ) = x. But then w = 
T ( S(w )) — T(x) = 0, so, by the linearity of S, x = 0. Hence V = ker T © >S(IV). 
By Exercise 1.3, dim V = dimker T + dim S(W) = dimker T + dim W. □ 

Remark The conclusion of Theorem 1.3 is commonly referred to as the rank- 
nullity theorem, which states that the rank plus the nullity of any linear map equals 
the dimension of the domain. 

EXERCISE 1.13 Show that if T : V — > IT is an injective linear map between spaces 
of the same dimension then T must be an isomorphism. 

EXERCISE 1.14 Let S and T be two endomorphisms of V. Show that rk f .S' 7’ ) < 
minjrk S, rk T}. 
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It is almost silly to give this map a name, because it really doesn’t do anything. The idea is that one can view 
ker T as a separate vector space as well as a subspace of V, and the inclusion map just sends every element of 
the first space to the corresponding element of the second. 
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u 



V 


v/u 


Figure 1.1 The quotient space construction. 


1.4 Quotient spaces 

Let V be a vector space, and let U be a subspace of V . We define a natural equiva- 
lence relation' 1 on V by setting v ~ w i f v — w e U . The set of equivalence classes 
of this equivalence relation is denoted V/U and is called the quotient space of V 
modulo U. The (canonical) projection map is the map n : V V/U given by 
v — > [w], where [ v ] denotes the equivalence class of v in V/U', v is called a class 
representative of [u]. The canonical quotient space construction is illustrated in 
Figure 1.1. The intuition behind the picture is that two distinct points in V that 
differ by an element of U (or, if you wish, by an element of the image of U under 
the inclusion map l) get squashed (or projected) to the same point in the quotient. 


Theorem 1.4 IfU is a subspace of the vector space V then V/U carries a natural 
vector space structure, the projection map is linear, and din \(V/U) = dim V — 
dim U . 


Proof Let v,weV. Define [w] + [w] = [u + w]. We must show that this is well 
defined, or independent of class representative, meaning that we would get the 
same answer if we chose different class representatives on the left-hand side. But if 
v' ~ v and w' ~ w then v' — v e U and w' — w e U, so (i/ + w') — (v + w) e U , 
which means that [t/ + w'\ = \ v + w ] . Next, define a [ v | = [ av ] for some scalar 
a. If v ~ w then v — w € U so that a(v — w) e U, which means that [nu] = 
[aw]. Thus scalar multiplication is also well defined. Hence, V/U equipped with 
these operations is a vector space. Observe that the zero vector of V/U is just 
the zero class in V, namely the set of all vectors in U . The definitions imply that 
[av + bw ] = a \ v ] + b \ w ] , so the projection map is linear. The statement about the 
dimensions follows from Theorem 1.3 because 

0 * U — ^ V — V/U * 0 (1.10) 

is exact, as n is surjective with ker^ = U . □ 
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Remark We have followed standard usage and called the map 7T : V — > V / U a 
projection, but technically it is not a projection in the sense of Exercise 1.9 because, 
as we have defined it, V/U is not a subspace of V. For this reason, although you 
may be tempted to do so by part (a) of Exercise 1.9, you cannot write V = U © 
V/U. However, by Theorem 1.2, the exact sequence (1.10) splits, so there is a 
linear injection S : V/U -> V. If we define W = S(V/U ) then, by Theorem 1.3, 
V = U © W. Conversely, if V = U © W then W = V/U (and U = V/W). 

EXERCISE 1.15 Let U be a subspace of V and let IT be a subspace of X. Show 
that any linear map cp : V — ► X with tp(U) C W induces a natural linear map 
tp : V/U — »■ X/W. Hint: Set <p([i>]) = [(p{v)\. Show this is well defined. 


1.5 Matrix representations 

Let V be n -dimensional. Given an endomorphism T : V -> V together with a 
basis {e, } of V we can construct an n x n matrix whose entries 7) ; - are given by 6 

Te } = (1.11) 

i 

One writes (7} ; ) or T to indicate the matrix whose entries are 7) ; -. The map T T 
is called a representation of T (in the basis {e, }). A different choice of basis leads 
to a different matrix, but they both represent the same endomorphism. 

Let v — v i e i e C. Then 

v ' ■- Tv = J2 V 1 T e i = Y! v J ei T ‘j 

j u 

= H Ti J v j\ ei = H v i ei ’ 

so the components of v' are related to those of v according to the rule 

= (L12) 

j 

EXERCISE 1.16 Let S and T be two endomorphisms of V . Show that if S —> S 
and T — »• T then ST — > ST, where ST So 7’, the composition of S and T, and 
matrix multiplication is defined by 

(■ ST) u = J2 s ikT kj - (M3) 

k 

(This shows why matrix multiplication is defined in the way it is.) 


6 


Note carefully the placement of the indices. 
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EXERCISE 1.17 Let T : V — > V be an automorphism represented by T . Show that 
T~ l — > T 1 , where T 1 denotes the inverse matrix. 

The row rank (respectively, column rank) of a matrix T is the maximum number 
of linearly independent rows (respectively, columns), when they are considered as 
vectors in M" . The row rank and column rank are always equal, and they equal the 
rank rk T of 7’. If rk T equals n then we say that 7 has maximal rank; otherwise 
it is said to be rank deficient. 

EXERCISE 1.18 Show that the rank of the endomorphism T equals the rank of the 
matrix T representing it in any basis. 


1.6 The dual space 

A linear functional on V is a linear map / : V —* IF. The set V* of all linear 
functionals on V is called the dual space (of V), and is often denoted Hom( V, M). 
If / is a linear functional and a is a scalar, af is another linear functional, defined 
by ( af)(v ) = af(v) (pointwise multiplication). Also, if / and g are two linear 
functionals then we can obtain a third linear functional / + g by (/ + g)(v) = 
f(v)+g(v) (pointwise addition). These two operations turn V* into a vector space, 
and when one speaks of the dual space one always has this vector space structure 
in mind. 

It is customary to write {v, f) or (/, v) to denote f(v). When written this way 
it is called the natural pairing or dual pairing between V and V*. Elements of 
V* are often called covectors. 

If {c,} is a basis of V, there is a canonical dual basis or cobasis {0 ; } of V*, 
defined by (e, , 0^ = 8jj, where <5,-, is the Kronecker delta: 


8 


ij 


1 if i = /, and 
0 otherwise. 


(1.14) 


Any element / e V* can be expanded in terms of the dual basis as 

f = J2fi0i, (U5) 

i 


where /, e IF. The scalars /, are called the components of / with respect to the 
basis {0,}. 


EXERCISE 1.19 Show that { 6 j] is indeed a basis for V*. 
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Note the angular shape of the brackets. 


1.7 Change of basis 


9 


It follows from Exercise 1.19 that dim V* = dim V . Because they have the same 
dimension, V and V* are isomorphic, but not in any natural way. 8 On the other 
hand, V and V** (the double dual of V) are always isomorphic via the natural 
map m(/n> 

EXERCISE 1.20 Show that the dual pairing is nondegenerate in the following 
sense. If (/, v) = 0 for all v then / = 0 and if (/, v) = 0 for all / then v — 0. 

EXERCISE 1.21 Let W be a subspace of V . The annihilator of W, denoted Ann W, 
is the set of all linear functionals that map every element of W to zero: 

Ann W := {6 e V* : 6(w) = 0, for all w e W}. (1.16) 

Show that Ann W is a subspace of V* and that every subspace of V* is Ann W for 
some W. In the process verify that dim V — dim W + dim Ann W. Hint: For the 
second part, let U* be a subspace of V* and define 

W :={v eV : f(v) = 0, for all / e U*}. 

Use this to show that U* c Ann W. Equality then follows by a dimension argument. 

EXERCISE 1.22 Let W be a subspace of V. Then (V/W)* = V* /W* just by 
dimension counting, even though there is no natural isomorphism (because there 
is no natural isomorphism between a vector space and its dual). But there is an 
interesting connection to annihilators that gives an indirect relation between the two 
spaces. 

(a) Show that (V/W)* = Ann W. Hint: Define a map ip : Ann W (V/W)* by 
/ i->- (p(f), where <p(f)([vf) := f(v), and extend by linearity. Show that cp is 
well defined and an isomorphism. 

(b) Show that V*/ Ann W = W* . Hint: Let it : V — »■ W be any projection onto W, 
and define tt* : L*/^ nn ^ ~ ^ W* by [/] i-* / o jv, so tt*([f])( v) = f(n(v)). 
Again, show that it* is well defined and an isomorphism. 


1.7 Change of basis 

Let {e,} and {e / } be two bases of V. Each new (primed) basis vector can be written 
as a linear combination of the old (unprimed) basis vectors, and by convention we 
write 

e 'j ~ 12 e ' Ai J (L17) 

i 


° The situation is different if V is an inner product space. See Section 1.10. 

9 When V is infinite dimensional, the situation is more subtle. In that case the statements V = V* and V = V** 
are both generally false. A notable exception occurs when V is a Hilbert space and V* is defined as the 
continuous linear functionals on V. See e.g. [74]. 
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for some nonsingular matrix A = ( A ,j ) , called the change of basis matrix 
By definition, a change of basis leaves all the vectors in V untouched - it merely 
changes their description. Thus, if v = JT ?;,<?, is the expansion of v relative to the 
old basis and v = JT v \ e 'i is the expansion of v relative to the new basis then 

e, = ^ Viet, 
i 

Hence, 

Vi = Y2 A ij v 'j or v\ = ^2(A~ l )ijVj. (1.18) 

j j 

Note that the basis vectors and the components of vectors transform differently 
under a change of basis. 

A change of basis on V induces a change of basis on V* owing to the require- 
ment that the natural dual pairing be preserved. Thus, if {(9,} and {6-} are the dual 
bases corresponding to {e, } and {<?'}, respectively, then we demand that 

{e' i ,e' j ) = {e i ,6^ = 8 i j. (1.19) 

Writing 

9 j = J2 9 ‘ B U ( 1 - 2 °) 

i 

for some matrix B = ( Bjj ) and using (1.19) gives 1 1 

% = { e o 9 'j) = y ^\ e kAki, OeB^ = { ek , 0i) 

kl kt 

= 7 ! AkiBijSu = ^ Aki Bkj- ( 1 - 21 ) 

kl k 

Writing A 1 for the transpose matrix, whose entries are (A r )q' := A ;i , (1.21) can 
be written compactly as A 1 B = I, where I is the identity matrix. Equivalently, 
we have B = (A r ) _1 = (A~ l ) T , the so-called contragredient matrix of A. 

10 Equation (1.17) looks a lot like (1.11), so much so that one can define a linear map A : V — > V, called the 
change of basis map, given by e; h-> e J and extended by linearity. This map can be a bit confusing, though, 
for the following reason. In general, a map T : V — > W is represented by a matrix (7Jy) obtained from 
T e{ = Tji fj , where {ej } is a basis for V and {/*• } is a basis for W . Viewing A as a map from V equipped 
with the basis {£/} to V equipped with the basis {e(}, the map A is represented by the identity matrix even 
though it is not the identity map. Instead, the change of basis matrix is what we obtain if we use the same 
basis {e/} for both copies of V . To avoid these sorts of confusion we shall speak only of the change of basis 
matrix when discussing basis changes. 

1 1 Sometimes the last step in (1.21) gives beginners a little trouble. We are using the substitution property of the 

Kronecker delta. For example, — v m, because the Kronecker delta vanishes unless l = m. To be 

really explicit, if, say, m = 2 and the indices run from 1 to 3, then Vf> 8 j >2 = v\ 8\2 + V 2^22 + v 3$32 = v 2 -> 
because 8\2 = ^32 = 0 and <$22 = 1- 


E 


v i e i = 


yi v j e ' A u y^, ( y 

i i i \ / 


1 . 8 Upstairs or downstairs ? 
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If / e V* is any covector then under a change of basis we have 




1 \ J 


from which we conclude that 


(1.22) 


It is instructive to rewrite (1.20) and (1.22) in terms of the matrix A instead of 
the matrix B. Thus, (1.20) becomes 

O'i = = I> %0j, (1.23) 

j j j 

while (1.22) becomes 

fj = = E < |24 » 

i i i 


1.8 Upstairs or downstairs? 

At this point we encounter a serious ambiguity in the notation. How can you tell, 
just by looking, that the numbers a , are the components of a vector or covector 
(or neither)? The answer is that you cannot. You must know from the context. 
Of course, there is no problem as long as the numbers are accompanied by the 
corresponding basis elements e, or 9,. In many applications, however, the basis 
elements are omitted for brevity. 

This ambiguity can be partially ameliorated by introducing a new notational 
convention that, while convenient, requires a shaip eye for detail: one changes 
the position of the index i. The logic behind this is the following. If one com- 
pares (1.17) with (1.24) and (1.18) with (1.23) one sees something very interesting. 
Under a change of basis, the components of a covector transform like the basis vec- 
tors whereas the components of a vector transform like the basis covectors. We say 
that the components of a covector transform covariantly (“with the basis vectors”), 
whereas the components of a vector transform contravariantly (“against the basis 
vectors”). 

By convention, we continue to write e, with a lower index for the basis vec- 
tors but we use a raised index to denote the basis covectors. Thus, we write 9' 
instead of 9, . Consistency with the aforementioned transformation properties then 
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requires that vector components be written with upstairs (contravariant) indices and 
covector components be written with downstairs (covariant) indices. We therefore 
write 


v — ^ v'ej (1-25) 

i 

instead of (1.5) and 

f = ( L26 > 

i 

instead of (1.15). 

This new notation has the advantage that the transformation properties of the 
object under discussion are clear by inspection. 12 For example, you notice that 
in (1.25) and (1.26) the indices are paired one up and one down, then summed. 
Whenever this occurs, we say the indices have been contracted. Any expression in 
which all the indices are contracted pairwise is necessarily invariant under a change 
of basis. So, for example, we see immediately that both v and / are invariant. 
Moreover, the positions of the indices acts as an aid in computations, for one can 
only equate objects with the same transformation properties. One can write an 
expression such as a, = b' to indicate an equality of numbers, but neither side could 
be the components of a vector or covector because the two sides have different 
transformation properties under a change of basis. Consequently, such expressions 
are to be avoided. 

Under this new convention, many of the previous formulae are modified. For 
instance, the natural pairing between vectors and covectors becomes 

(ei,0J) = 8j, (1.27) 

where 8] is just another way to write the Kronecker symbol: 



1 

0 


if i = j , and 
otherwise. 


(1.28) 


Similarly, the change of basis formulae (1.17) and (1.18) are replaced by 



(1.29) 


and 


v n = ^(A _1 )' ; iF, 
j 


(1.30) 


12 


This becomes especially important when dealing with tensors, which have many more indices. 
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respectively, where the first (row) index on the matrix is up and the second (column) 
index is down. 

The problem with this notation is that it is sometimes unnecessarily pedantic, 
and can also lead to awkward formulas. For example, with this same convention, 
(1.20) and (1.22) must be written 



(1.31) 


and 



(1.32) 


respectively. In both these formulae, the first index on the matrix is down and the 
second is up. Unless you are very careful about writing such a formula, you can 
easily get the wrong result if you place the indices in wrong locations. This can 
be avoided by using (1.23) instead of (1.20) and (1.24) instead of (1.22) and then 
raising the appropriate indices. It can also be avoided by adopting a convenient 
matrix notation, which we will have occasion to use later. Write A = (A 1 j) and 
introduce the shorthand notation 


/e l \ 

e 2 


e = (e\ e 2 ■■■ e n ), 6 = 


(1.33) 


\0 7 



v = 


f = (/i fi ■■■ /,,)■ 


(1.34) 


w 

Then (1.17), (1.18), (1.23), and (1.24) can be written 


e' = eA, 
v' = A~ l v, 
6' = A~ l 0, 


(1.35) 

(1.36) 

(1.37) 


and 


r = / a , 


(1.38) 


respectively. This notation makes the invariance of v and / under a change of basis 
easy to see. For example, v' = e'v' = eAA~ l v = ev = v. 
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Although there are times when it is necessary to write all the indices in their 
proper places, there are many times when it is not. Unless we are specifically con- 
cerned with the transformation properties of the quantities in an equation, it is not 
necessary to worry too much about index placement. Consequently, although we 
will mostly adhere to this notational convention, there will be occasions when we 
will not. Nonadherence will cause no problems, though, because precisely in those 
instances the transformation properties of the various intermediate quantities will 
be irrelevant to the final result. 


1.9 Inner product spaces 

So far everything we have said about vector spaces and linear maps works in 
essentially the same way regardless of the underlying field F, but there are are 
a few places where the underlying field matters and here we consider one of them. 
For simplicity we mostly restrict ourselves to real fields in this book, but in the 
next three sections we discuss complex fields explicitly in order to save time later. 
The advantage of working over the complex numbers is that the real numbers are 
included in the complex numbers, so expressions valid for the complex numbers 
can usually be specialized to the real numbers without any additional effort. 

Thus, let F be a subfield of C, and let V be a vector space over F. A sesquilinear 
form on V is a map g : V x V — > F satisfying the following two properties. For 
all u, v, w € V and a,b e F, the map g is 

(1) linear on the second entry: g(u, av + bw) = ag{u, v) + bg{u, w), and 

(2) Hermitian: g(v, u) = g(u, v), 

where a is the complex conjugate of a. Note that these two properties together 
imply that g is antilinear on the first entry: 

g{au + bv, w) = ag(u, w) + bg(v , w). 

However, if F is a real field (a subfield of M) then a = a and b = b and the above 
condition condition just says that g is linear on the first entry as well. In that case 
we say that g is a bilinear form. Moreover, the Hermiticity condition becomes 
the symmetry condition g(u, v) = g{v, u), so a real sesquilinear form is in fact a 
symmetric bilinear form. 1 3 If the sesquilinear form g is 

(3) nondegenerate, so that g(u, v) = 0 for all v implies u = 0, 

then it is called an inner product. A vector space equipped with an inner product 
is called an inner product space. 
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The prefix “sesqui” means “one and a half”, which is appropriate because a sesquilinear form is in some sense 
halfway between linear and bilinear. 


1.9 Inner product spaces 
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By Hermiticity g(u, u) is always a real number. We may thus distinguish some 
important subclasses of inner products. If g{u, u) > 0 (respectively, g(u, u) < 0) 
then g is nonnegative definite (respectively, nonpositive definite). A nonnegative 
definite (respectively, nonpositive definite) inner product satisfying the condition 
that 


g(u, u) = 0 implies u = 0 

is positive definite (respectively, negative definite). A positive definite or negative 
definite inner product is always nondegenerate but the converse is not true, as you 
can verify by doing Exercise 1.23. 

Example 1.2 The standard example of a complex vector space (a vector space 
over the complex numbers) is C", the Cartesian product of C with itself n times: 
C x • • ■ x C. A vector in C" is just an /(-tuple (a \ , of complex numbers, with 

scalar multiplication and vector addition defined componentwise as in the real case. 
If u = (u i , . . . , u n ) and v = (iq , . . . , v n ) then the standard inner product is given by 

n 

g(u, v ) := y^ujVj. 
i = 1 

Example 1.3 (The Euclidean inner product or dot product on R" ) If u = 

(ui, U 2 , .... u n ) and v — (v\, V 2 , ■ ■ ■ , v„) then we may define 

n 

g{u,v) :='Y^u i v i . (1.39) 

1 = 1 

The vector space R" equipped with the inner product (1.39) is usually denoted E" 
and is called Euclidean space. People often fail to distinguish between E" and E", 
because the Euclidean inner product is the usual default. But, as the next example 
shows, we ought not be quite so cavalier. 

Example 1.4 (The Lorentzian inner product on E" ) Let u = (mo, mi, ... , m„_i) 
and v — (t>o, vi, . . . , i/„_i), and define 

n - 1 

g{u, v) := mquq + YuiVj. (1-40) 

;= 1 

The vector space M" equipped with the inner product (1.40) is usually denoted M" 
and is called Minkowski space (or, more properly, Minkowski spacetime). The 
inner product space M 4 is the domain of Einstein’s special theory of relativity. 

EXERCISE 1.23 Verify that the standard inner product on C" and the Euclidean 
and Lorentzian inner products are indeed inner products, but that only the first two 
are positive definite. (The Lorentzian inner product is indefinite because g(v, v ) 
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may be positive, zero, or negative.) Note that a positive definite inner product is 
automatically nondegenerate (just take v = u in the definition); this exercise shows 
that the converse is false. 

A set {u, } of vectors is orthogonal if 

g(vj,Vj) = 0 whenever ij^j, (1.41) 

and it is orthonormal if 

g(Vi,Vj) = ±Sij. (1.42) 

A vector v satisfying g(v, v) = ±1 is called a unit vector. 

Example 1.5 The standard basis vectors {e; = (0, . . . , 1, . . . , 0)} of R" form an 
orthonormal basis for E" as well as M" . 

Not only do orthonormal sets of vectors exist, but we can always find enough of 
them to span the space, as the next result shows. 

Theorem 1.5 Every inner product space has an orthonormal basis. 

We offer two proofs of this fact, because both are instructive and interesting. The 
first is more “abstract”, using only the tools introduced so far, while the second is 
more “algebraic” and uses the idea of diagonalizability. 

First proof of Theorem 1.5 We use induction on k = dim V. If dim V = 1 then 
there must be at least one nonzero vector v e V such that g(v, v) f 0. Otherwise, 
for all v, w € V, 

0 = g(v + w, v + w) = g(v, v ) + 2 g(v, w ) + g(w, w) = 2 g(v, w ), (1-43) 

which contradicts the nondegeneracy condition. So v/*J\ g(v, u)| is an orthonor- 
mal basis for V. 14 Now suppose that we have found k — 1 orthonormal vectors 
e\, e 2 , . . . , ek-i in V . They span a (Jc — l)-dimensional subspace W. Let jt : V — > 
W be the orthogonal projection map given by 

k - 1 

n(v) = ^g(e/, v)e/. (1.44) 

!=1 

It is linear by construction and surjective because tt(w) = w for all w e IV. Thus 
we get an exact sequence 

0 ► W- 1 > V W > 0, 


14 


The absolute value of x is denoted |jc|. 
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where 

W 1 - = kern- = j«e V: g(v, w) = 0 for all w e W} (1-45) 

is the orthogonal complement of W . By Theorem 1.3, V = W © W ± (where we 
have identified W with its image under the inclusion map, which is a section of 
7r), so every v e V can be written uniquely as v = w + ur 1 for some w € W 
and w 1 - € W 1 . The inner product on V induces an inner product on W l . We need 
only show that this induced inner product is nondegenerate, for then by the above 
argument there is a unit vector in W 1 - which by construction is orthogonal to all the 
e ; . Let w' e W x be fixed. Suppose g(w', w ± ) = 0 for all w 1 - € W 1 . Then, for any 
v € V, g(w ' , v) = g(w', w + in- 1 ) — g(w', w ) = 0, so we must have w' = 0. □ 

Below we offer a second proof of Theorem 1.5, which is valid only for real 
fields, in order to define the concept of the signature of a real inner product. Recall 
that the determinant of a matrix (A i; ) is defined by 

det A := y (— l) a Ai a (i)A 2 a( 2 ) • • • Ano{n) ■> (1.46) 

cre&n 

where 6 „ is the set of all permutations of n elements and (— 1 )" denotes the sign of 
the permutation cr . 13 Given a real symmetric bilinear form g and a set {iq, . . . , v n ] 
of vectors we can construct a real symmetric n x n matrix G := ( g(vi , Vj)), 
called the Gram matrix of the set. The determinant of this matrix is called a Gram 
determinant, or Grammian, of the set of vectors. 

The following lemma will come in handy. 

Lemma 1.6 Let V be a real vector space with basis {e , }, and let g be a real 
symmetric bilinear form. Let g,j := g(e t , e f) and set G = (gij). Then g is an inner 
product if and only if the Grammian det G f 0. 

Proof Let v = «,<?,, and suppose that g(v, w) = 0 for all w. Then, for all j, 

0 = 'Y^,a i g(e l , ef) = ^ . (1.47) 

i i 

If deg (gij) = 0 then these equations have a nontrivial solution for the a,, whereas 
if det(g ;/ ) 7 ^ 0 then all the a, vanish, so v = 0. (See Exercise 1.34.) □ 

Second proof of Theorem 1.5 16 Pick a basis {e ,-}" =1 for V. The Gram matrix 
G = (g(ei , c ; )) is a real symmetric matrix. By the spectral theorem of lin- 
ear algebra (see Appendix B), there exists an orthogonal matrix R such that 

15 For the reader's convenience, some basic facts about permutations are collected in Appendix A. For more 
about determinants, see the additional exercises at the end of this chapter as well as Section 2.8. 

16 For the terminology used herein see the additional exercises at the end of this chapter. 
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X := R l GR = R t GR is a diagonal matrix. (The diagonal elements A,- := E,, 
of X are the eigenvalues of G and the columns of R arc the eigenvectors of G.) By 
Lemma 1.6 all the eigenvalues are nonzero (because the determinant is the prod- 
uct of its eigenvalues). Define a new diagonal matrix B with B,, = | A, | ~ 1 / 2 . Then 
D ■= B t J,B is a diagonal matrix with A, — ±1. Tracing back the definitions 
gives D = M 1 GM, where M := RB. Define vectors v, := ^u e k- Then 

g(vt, vj) = ^ ~^M ki Mijg{ek , ei) = ( M 1 GM)ij = Ay- 

k,l 

It follows that Vi, ... ,v n are the orthonormal vectors we seek. □ 

By reordering the basis elements, if necessary, we may write the diagonal matrix 
appearing above as 


D — diag(+l, . . . , +1, — 1, . . . , — 1), 

' V J V V- '' 

p q 

where there are p plus signs and q minus signs. The signature of the inner product 
g may refer to any of the following: (1) the pair (p, q), (2) the number p — q, or 
(3) the signs of the diagonal elements themselves. 1 The numbers p and q depend 
only on the real bilinear form g and not on the basis used to define the Gram matrix 
G, a result known as Sylvester’s law of inertia. 1 

Example 1.6 Let {e, } be the standard basis of R" . Then the Gram matrix elements 
of the Euclidean inner product are given by 

gij = g(ei,ej) = Sij, (1.48) 

so G is already diagonal, with ones along the diagonal and zeros elsewhere. Hence 
this inner product has signature n or (+ + • — b). Similarly, in that same basis, the 
Gram matrix elements of the Lorentzian inner product are 

gij = g(?i, ef) = rjtj , (1.49) 


17 Definitions (1) and (2) are sometimes preferred, because there is no material difference between, say, ( — t- 

+ •••+) and (H f • — |-). Indeed, only \p — q\ is really of interest, because, for example, inner product 

spaces with signature ( — h H — • +) are essentially equivalent to spaces with signature (H •••—). 

18 The proof of Sylvester’s law is rather subtle. The second proof of Theorem 1.5 shows that under a change of 
basis effected by a matrix A, the Gram matrix G changes by a congruence: 

G -+ G' = A t GA. 

But a congruence does not generally preserve eigenvalues, so the spectra of G' and G are not necessarily the 
same. (In contradistinction, a similarity transformation G' = A~^GA does preserve the spectrum. See Exer- 
cise 1.43.) The content of Sylvester’s law is that the signs of the eigenvalues are preserved by a congruence. 
For a proof see [28], Chapter X, §2, Theorem 1 or [31], Section 9.9. 


1.10 The Riesz lemma 
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where 


-1 



if i = j — 0, 
if i — j ^ 0, and 
otherwise. 


(1.50) 


Hence, the Lorentzian inner product has signature n — 2 or ( — h H — • +). 


1.10 The Riesz lemma 

In Section 1.6 we mentioned that, in general, there is no natural isomorphism 
between a vector space V and its dual space V*. Things are different if there is 
an inner product around. In that case there is a natural isomorphism \f : V -> V*, 
and this is the content of the Riesz lemma. Actually, things are more subtle than 
that, because there is a difference between the real and the complex case. The real 
case is the most important for our puiposes, but we need to discuss the complex 
case as well in order to understand adjoint maps. 

Define a map f : V — > V* by v m>- /„, where 


M-):=g(v,-). (1.51) 

Equivalently, for all w e V, f v (w ) = g(v,w). 

Lemma 1.7 (Riesz) If F = M then the map f is an isomorphism, whereas if 
F = C then the map f is an antiisomorphism. 19 

Proof 20 Assume the field is complex. First observe that f really does map to V*, 
because g is always linear on the second entry. Next we have 

\f(au + bv)(w ) — g(au + bv, w ) 

= ag(u, w ) + bg(v, w ) 

= (crf(u) + bx//(v))(w), 

so xf is antilinear in general. Clearly it becomes linear when the field is real. By 
nondegeneracy of the inner product if follows that ker \[r — 0, so x/r is injective. 
But V and V* have the same dimension, so by the rank-nullity theorem, f is also 
surjective. □ 

19 An antiisomorphism between two complex vector spaces - sometimes called a conjugate linear isomor- 
phism - is just an antilinear bijection. 

20 We only show this in finite dimensions, where the proof is easy. Riesz’s actual lemma is valid for Hilbert 
spaces, which are a special class of infinite-dimensional vector spaces. For another proof of Lemma 1.7, see 
Exercise 1.54. 
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1.11 Adjoint maps, transpose maps, and duality 


Let V and W be two vector spaces. If A : V — > W is a linear map, there is an 

induced linear map of dual spaces A* : W* —>■ V*, given by 

(A*f)(v) = f (Av) (1.52) 

or, equivalently, 

(A*/, v) = </, Av) (1.53) 

for all / e W* and v € V. (The dual pairing on the left connects V* and V, 
while that on the right connects W* and W .) The map A* has many names in the 
literature: “adjoint”, “transpose”, “pullback”, etc. Unfortunately, all these words 
have other meanings as well. In this section we’ll discuss the reason for the first 
two names (one of which is not really correct), while the third name is discussed in 
the context of more general maps in Section 3.14. If we need to refer to A*, we’ll 
simply call it the dual map of A. 

EXERCISE 1.24 Show that A* is linear. 

For simplicity let us now restrict our attention to the case V = W. 21 The next 
exercise explains why the name “transpose” is sometimes used for the dual map 
(and also shows that A* is uniquely defined by A). 

EXERCISE 1.25 Let {e, } be a basis for V with corresponding dual basis {<?*}. Show 
that if A is represented by the matrix A relative to {e;} then A* is represented by the 
transpose matrix A T relative to {e*}. 

To explain the name “adjoint” we suppose that V is equipped with an inner 
product g. In that case, the map A : V V induces another map A 1 : V V, 
called the adjoint of A, by the requirement that 

g(A^w, v) = g(w, Av), (1-54) 


for all v, w € V. 

EXERCISE 1.26 Given a matrix A the adjoint matrix A'' is defined to be the con- 
— T 

jugate transpose matrix A , namely the matrix obtained by conjugating every entry 
and taking the transpose. Show that if the linear map A is represented by the matrix A 
in some orthonormal basis {e, } then its adjoint map A 1 is represented by the adjoint 
matrix A 1 with respect to the same basis. 
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The following results generalize easily to the case in which V 7^ W, but to do so we would have to keep track 
of four bases rather than just two. 
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Now here is the point. When the underlying field is real, conjugation is trivial and 
the adjoint map is represented by the transpose matrix (assuming an orthonormal 
basis). It is for this reason that some authors refer to the dual map as the adjoint 
map. But in general they are different, so you should really avoid using the same 
terminology for both. 


Additional exercises 

1.27 Linear independence Let {tq , . . . , v n ) be a linearly independent set of vec- 
tors. Show that the vectors v\, . . . , v n , w are linearly dependent if and only if 
w is a linear combination of iq , . . . , v n . 

1.28 Vector spaces of polynomials Let V be the vector space of all polynomials 
in one variable x of degree at most 3, where the vector space operations are 
defined pointwise: (/ + g)(x) = f (x) + g (x ) and ( af)(x ) = af (x ) . Show 
that the monomials 1, x, x 2 , and x 3 form a basis for V. 

1.29 The cokernel Suppose that T : V — > W is linear. The kernel of T is the set 
of all vectors in V that are sent to zero by T. It measures the failure of T to 
be injective. The failure of T to be surjective is measured by the cokernel of 
T . But the cokernel is not just the set of vectors in W that are not in the image 
of T . Instead, coker T is defined to be the quotient space W / im T . Show that 
this is in some sense “natural”, because it makes the sequence 

0 > kerT > V — — > W > coker T > 0 (1.55) 

exact. 

1.30 An alternating sum If 

0 * V 0 * V! * ••• * V n * 0 

is an exact sequence of vector spaces, show that 

n 

£](— D'dimV, =0. 

i=0 

Hint: Use exactness and the rank-nullity theorem. 

1.31 Determinant of the transpose Starting from the definition (1.46), show 
that, for any matrix A, 


det A = det A 1 . 


( 1 . 56 ) 


Hint: The sign of a permutation equals that of its inverse. 
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1.32 The Laplace expansion of the determinant Let A = (Ay) and let A{i\j) 
denote the matrix obtained from A by deleting the ith row and jth column of 
A. The minor of the element Ay is dct A (i\ j). The signed minor or cofactor 
of Aij is A ij := (— I Y + j det A(i\j). Prove the Laplace expansion formula 
for the determinant: that is, for any i, 

n 

det A = J^AyAy (1.57) 

7=1 

(the Laplace expansion by the ith row) or, for any j, 

n 

det A = Aij Ajj (1.58) 

i=i 

(the Laplace expansion by the jth column). Hint: Starting from (1.46), show 
that the coefficient of An in det A is just det A ( 1 1 1 ) = An. By executing row 
and column flips, show that the coefficient of Ay in det A is Ay. Here you 
need to use the fact, easily proved from the definition of detA, that switching 
any two rows or columns of a matrix flips the sign of its determinant. Next, 
consider the sum (1.57) with i = 1, say, and show that it contains each term 
of det A once and only once. 

1.33 The adjugate and the inverse The transpose of the matrix of cofactors is 
called the adjugate of A, written adj A. Thus, (adj A)y = Ay. Show that 

A(adj A) = (adj A) A = (det A)/, (1-59) 

where I is the identity matrix. Hint: We have 

n n 

[A (adj A)] ik = ^7 (adj A) jk = Ay Ay . (1.60) 

7=1 7=1 

Now think about the two cases i = k and i ^ k. 

1.34 Invertibility and the determinant A matrix A is singular if det A = 0 
and nonsingular otherwise. Show that A is invertible if and only if it is non- 
singular, and thereby show that Av = 0 has a nontrivial solution for v if and 
only if detA = 0. Big hint: Use (1.59). 

1.35 Cramer’s rule Let A = (Ay) be a nonsingular matrix, let x = 
(xi, . . . , x n ) T be a vector of inde terminates, let b be a fixed vector, and con- 
sider the simultaneous set of linear equations Ax = b. Prove Cramer’s rule, 
namely that x, = det A U) / detA, where A (,) is the matrix A with the ith 
column replaced by the vector b. Hint: Use the adjugate. 
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1.36 Derivative of the determinant Viewing the individual entries of the matrix 
A as independent variables and the determinant as a function of those 
variables, show that 

— (det A) = Ajj, (1.61) 

dAij 

where A i; - is the cofactor of A (/ - in det A. Show that, when A is invertible, 

-^-(detA) = (detAXA- 1 );,-. (1.62) 

dAij 

Hint: Specialize to the case i = 1, j = 2, say, and consider the Laplace 
expansion by the first row; then generalize. You may want to use (1.59). 

1.37 Nonsingular matrices and rank Let T : V — > V be an automorphism and 
T its matrix representation relative to some basis. Show that: 

(a) T has maximal rank; 

(b) T is nonsingular. 

1.38 Orthogonal transformations and the orthogonal group In the following 
exercises, V is a real vector space with a positive definite inner product g. An 
orthogonal transformation is a linear map R : V — V that preserves the 
inner product: 


g(Rv, Rw ) = g( v, w). 

(a) Show that R maps an orthonormal basis to an orthonormal basis and 
that every linear map carrying one orthonormal basis to another must be 
orthogonal. 

(b) Let {e, } be an orthonormal basis for V, and let R represent the orthogonal 
transformation R in this basis. The matrix R is orthogonal if R 1 R = 
RR 1 = I . Show that the linear transformation R is orthogonal if and 
only if the matrix R is orthogonal. 

(c) Let R be an orthogonal matrix. Show that det R = ±1. Hint: Use (2.54). 
Remark: If det R = 1, R is a rotation matrix; otherwise it is a roto- 

reflection matrix. 

(d) The set of all orthogonal transformations on M' ! that preserve the 
Euclidean inner product is denoted 0(n ) and called the orthogonal 
group. Show that 0{n) is {\n(n — l))-dimensional, in the sense that it 
requires this many parameters to specify an element of 0{n). Hint: How 
many conditions must an n x n orthogonal matrix satisfy? 

1.39 The determinant of the inverse is the inverse of the determinant Show 

that det A -1 = (det A) -1 . Hint: Use (2.54). 
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1.40 Similarity transformations Let e and e' be two bases for V , with e' — eS 
(so that S is the change of basis matrix). Let A : V — »■ V be a lin- 
ear operator, represented by A relative to e and by A' relative to e' . Show 
that A' = S _I AS. We say that A and A' are related by a similarity 
transformation. 

1.41 Diagonalizability Let A be an n x n matrix and v an n x I matrix (i.e., a 
column vector or column matrix). If Av = kv for some scalar k then v is an 
eigenvector of A with eigenvalue k. If there exists a nonsingular matrix S 
such that X := S _I AS is a diagonal matrix (so that E (/ = 0 if i ^ j) we 
say that A is diagonalizable (by a similarity transformation). Show that A is 
diagonalizable if and only if it has n linearly independent eigenvectors. Hint: 
Let the columns of S be the eigenvectors of A. 

1.42 Eigenvalues and the characteristic polynomial Show that the eigenval- 

ues of a matrix A are the roots of its characteristic polynomial pa (A) := 
det(A — A/). Remark: Although you can calculate the eigenvalues in this way 
it does not mean that the matrix is diagonalizable. The story is a bit involved, 
but worth recalling. The characteristic polynomial pa(A) is a polynomial of 
degree n. By the fundamental theorem of algebra, it always has n (complex) 
roots. If p is a root of PaW then A — /z divides it. The algebraic mul- 
tiplicity a p of the eigenvalue fi is the maximum degree of this factor in the 
characteristic polynomial. The geometric multiplicity g :i is the dimension of 
the eigenspace corresponding to p, (namely the number of linearly indepen- 
dent eigenvectors with eigenvalue n). One always has g jL < a The matrix 
A is diagonalizable if and only if a M for all eigenvalues /x; then the 

eigenvectors of A span the vector space. 

1.43 Eigenvalues are similarity invariants If /(A) = f(S ! A5) for some 
function / we say that / is a similarity invariant. Show that the eigenvalues 
of A are similarity invariants. Hint: Combine the result of Exercise 1 .42 with 
(2.54). 

1.44 The trace and determinant as sums and products of eigenvalues The 

trace of a matrix A is the sum of its diagonal elements: tr A := A;;. 

Show that detA and tr A are respectively the product and the sum of the 
eigenvalues of A. It follows from Exercise 1.43 that the determinant and the 
trace are similarity invariants. 

1.45 Cyclic invariance of the trace Show that the trace is invariant under cyclic 
permutations. That is, show that, for any matrices {A\, A 2 , . . . , A,,}, 

tr A\A 2 ■ ■ ■ A n = tr A„A 1 A 2 • • • A„_!. (1.63) 


Hint: First try the case n = 2. 
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1.46 Expansion of the determinant in terms of traces The elementary sym- 
metric functions are defined by 

e k (xi, . . . ,x n ) := ^ x h ■■■x ik , (1-64) 

l<il<--<ik<n 

for k > 1 ; we also set <?q = I . For example, when n = 3 we have 

e 1 (x 1 ,x 2 , x 3 ) = xi + x 2 + x 3 , 
e 2 (x 1 ,x 2 ,x 3 ) = xix 2 + xix 3 +x 2 x 3 , 
e 3 (xux 2 ,x 3 ) = XiX 2 x 3 . 

The power sum symmetric functions are defined by 

n 

Pk(x 1 , ...,x n ) Tlxf, (1.65) 

!=1 

for k > 1; we also set p 0 := 1. For example, when n = 3 we have 

p x (xi,x 2 ,x 3 ) = x 1 + x 2 + x 3 , 

F2(xi,x 2 ,x 3 ) = x\ + xf +x 3, 

P 3 (xi,x 2 ,x 3 ) = xj 5 + xf + xf. 

It is often convenient to allow an infinite number of variables, in which case 
we write 

e k := ^ x (| ■ ■ ■ x ik (1.66) 

l</l<— <i k 

and 

Pk := J>f. (1-67) 

i> 1 

To recover e*(xi , . . . ,x n ) and p k {x\, . . . , x„) from <?<• and pk just set x n+ \ = 
Xn+ 2 — ’ — 0 - 

(a) Define 

OO 

E(t) := J2ejt j , (1.68) 

,;=i 

the generating function of the elementary symmetric functions. Show that 

CO 

E(t) = Y\(l+Xjt). (1.69) 

j = 1 

Hint: Expand the product in powers of t. 
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(b) Define 


oo 

P{t):=Y,Pjt j ~\ (1-70) 

7=1 

the generating function of the power sum symmetric functions. (Note the 
index shift.) Show that 


7=1 



Hint: Recall the sum of a geometric series, 
(c) Derive Newton’s identities, i.e., for all k, 


(1.71) 


k 

ke k = J2(~ i y~ le k-iPi ■ ( 1 - 72 ) 

i=i 


Hint: Show that P(—t) = (dE{t)/dt)/E(t). 

(d) By repeated substitutions we can write the e’s in terms of the p’s, but this 
brute force method becomes rapidly too cumbersome. Instead, show that 


Pi 

1 

0 


P2 

Pi 

2 

0 

Pn - 1 

Pn- 2 


Pi n - 1 

Pn 

Pn - 1 


P2 Pi 


(1.73) 


Hint: Write out the first k Newton identities as a matrix equation for the 
unknown e, and use Cramer’s rule. Remark: There is a better way to write 
this formula using partitions of integers: see ([81], Chapter 7). 

(e) Write down an expression for the determinant of a 4 x 4 matrix A as a 
sum of products of the traces of powers of A. For example, for n = 3 we 
have 


det A = [(tr A) 3 — 3 (tr A)(tr A 2 ) + 2 tr A 3 ] . (1.74) 

Hint: Assume A to be diagonalizable and compute the determinant and 
the traces of powers of A in terms of the eigenvalues. 

1.47 The trace of an endomorphism The trace of an endomorphism A is 
defined to be the trace of any matrix that represents A. Show that the trace 
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of an endomorphism is well defined by showing that it is invariant under a 
change of basis. 

1.48 The trace, the determinant, and the exponential Show that, for any 
diagonalizable matrix A, 


e trA = dete A , (1.75) 

where the exponential of a matrix is defined by its Taylor expansion: 

e * = 1+A+ ^ I+ r* 3+ -- 

Hint: Consider the eigenvalues of A. Ignore convergence issues, which all 
work out correctly in the end. Remark: A theorem due to Schur guarantees 
that any matrix A can be decomposed into the sum of a diagonal matrix D 
and a strictly upper triangular matrix N . For extra credit, use Schur ’s theorem 
to show that (1.75) holds for any matrix A. 

1.49 The Cauchy-Schwarz inequality Assume that g is a positive definite real 
inner product. Prove the Cauchy-Schwarz inequality: for any two nontrivial 
vectors u and v. 


g(.U, V ) 2 < g(u, u)g(v, V ), 


with equality holding if and only if u = Xv for some scalar X. Hint: Minimize 
g(u + av, u + av) with respect to a. 

1.50 The L 2 (R) inner product Let L 2 (R) be the vector space of square 
integrable functions on R, where the vector space operations are defined 
pointwise: (/ + g)(x) = f{x) + g(x) and ( af)(x ) = af(x). A func- 
tion / is square integrable if f 2 (x) dx < oo. Show that the map 
(-, •) : L 2 (R) x L 2 (R) — > R defined by 


(/. g) ■= 



f(x)g(x)dx 


is a positive definite inner product. 

1.51 Linear independence and the Grammian Let g be an inner product on 
a real vector space V with dim V > n. Show that v\, ... ,v n are linearly 
independent if and only if the Grammian det(g(iy , rq)) is nonvanishing. Hint: 
You may find it easier to prove the negation instead, that is, to show that 
v\, ... ,v n are linearly dependent if and only if the Grammian vanishes. To 
that end, start with JY CjVj = 0 and analyze the system of equations you get 
by taking the inner product of this equation with all the i;, . 
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1.52 Gram-Schmidt orthonormalization Following the ideas given in the first 
proof of Theorem 1.5 we can deduce an algorithm, called Gram-Schmidt 
orthonormalization, for constructing an orthonormal basis from a linearly 
independent set of vectors. The algorithm is easiest to explain if we assume 
F = M and a positive definite inner product, but can be made to work in 
general with suitable modifications. 

Let {i>i, . . . , v n ] be linearly independent, where n = dim V . The first step 
is to set 

— Vl 
Vg(v t, Vi) 

Next, project v 2 onto the orthogonal complement of e\ and then normalize: 

e 2 

e 2 := v 2 - e 1 g(e 1 ,v 2 ), e 2 := - . 

Vs( e 2’ e l) 

Now, project v 3 onto the orthogonal complement of e\ and e 2 and normalize: 


e 3 := v 3 - e ] g(e u v 3 ) - e 2 g(e 2 , v 3 ), 




3^3) 


Continuing in this way, we end up with an orthonormal basis {ei, ... , e n } . 

Let V be the four-dimensional vector space consisting of the polynomials 
of degree at most 3 equipped with the inner product 


g(p,q) 



p(x)q(x ) dx. 


Using Gram-Schmidt orthonormalization starting from the linearly inde- 
pendent set {l, x, x 2 , x 3 } (cf. Exercise 1.28), construct an orthonormal 
basis of polynomials for V. Remark: The resulting basis functions are the 
(normalized) Legendre polynomials. 

1.53 Dual maps and annihilators Let T : V — W be a linear map with dual 
map T*. Show that ker T* = Ann im T and im T* = Annker T . 

Warning: ker T* ^ (ker T)* and im T* ^ (im T j*. Hint: The first assertion 
is easy, but the second is not. For the second, let / e Annker T and consider 
the function 


giw) 


j/(S(w)) if iu e im T, and 
1 0 otherwise, 


where S is a section of T. Recall Theorem 1.3. 
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1.54 Riesz’s lemma revisited Riesz’s lemma admits a more direct proof. For 
simplicity let us assume that the field is real. As previously observed the map 
v — ^ f v is then linear, so we need only show it is invertible. Let {<?,■ } be an 
orthonormal basis of V. Let / e V*, and suppose that fie,) = /, . Define 

Vf := ^2, gie,, 

i 

Show that the map / i-> Vf is the desired inverse. 
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Come, let us hasten to a higher plane 
Where dyads tread the fairy fields of Venn, 

Their indices bedecked from one to n 
Commingled in an endless Markov chain! 

Stanislaw Lem ' 

Multilinear algebra is basically just linear algebra with many vector spaces at the 
same time. The fundamental objects are generalizations of vectors called tensors. 
Tensors can be viewed from many different perspectives. Mathematicians intro- 
duce tensors formally as a quotient of a certain module, while physicists introduce 
tensors using objects with many indices that transform in a specific way under a 
change of basis. Each definition is viewed disparagingly by the other camp, even 
though they are equivalent and both have their uses. We follow a middle approach 
here. 


2.1 The tensor product 

To start, we define a new kind of vector product, called the tensor product, usually 
denoted by the symbol cg>. Given two vectors v and w, we can form their tensor 
product v <g> w. The product v ® w is called a tensor of order 2 or a second- 
order tensor or a 2-tensor. The order of the factors matters, as w <g> v is generally 
different from v <g> w. In fancy language, the tensor product is noncommutative. 
We can form higher-order tensors by repeating this procedure. For example, given 
another vector u we can construct u ® v <8> w, a third-order tensor. (The tensor 
product is associative, so we need not worry about parentheses.) Order-O tensors 
are just scalars, while order- 1 tensors are vectors. 

: Excerpt from “Love and tensor algebra”, from The Cyberiad: Fables For The Cybernetic Age by Stanislaw 
Lem, translated by Michael Kandel. English translation copyright ©1974 by Houghton Mifflin Harcourt Pub- 
lishing Company. Reprinted by permission of Houghton Mifflin Harcourt Publishing Company. All rights 
reserved. 
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In older books the tensor v ® w is sometimes called a dyadic product (of the 
vectors v and w) and is written vw. That is, the tensor product symbol ® is 
simply dropped. Even in modern works the tensor product symbol is often omitted. 
This generally leads to no confusion, as the only way to understand the proximate 
juxtaposition of two vectors is as a tensor product. 

The set T‘ of tensors of order r forms a vector space in a natural way; if S and 
T are both tensors of order r then a T + bS is again a tensor of order r. One writes 

T r = V®V®---®V = V® r . (2.1) 

" v '' 

r times 

The set T = [_J ( . T r of all tensors forms a mathematical object called an algebra, 
which basically means that addition and the tensor product are compatible in the 
usual way. Specifically, if R is a tensor of order r and S is a tensor of order s then 
R ® S is a tensor of order r+s . 1 In addition, scalars “pull through” tensor products, 

7’®(fl5) = (a7’)®5 = a(r®5), (2.2) 

and tensor products are distributive over addition: 

R®(S + T) = R®S + R®T, (2.3) 

(S + T)<8> R = S ® R + T ® R. (2.4) 

Just as a vector has components in some basis, so does a tensor. Let e \ , e 2 , e 3 be 
the canonical basis of R 3 . Then the canonical basis for the vector space R 3 ® R 3 of 
order-2 tensors on R 3 is given by the set e, ® ej as i and j run from 1 to 3. Written 
out in full, these basis elements are 

e x ®e 2 , e\ ® e 3 , 

e 2 ®e l , e 2 <g>e 2 , e 2 ®e 3 , (2.5) 

e 3 <g)ei, e 3 <S> e 2 , e 3 <g> e 3 . 

The most general second-order tensor on R 3 is a linear combination of these 
basis tensors: 

T = J2T ,J e,®ej. (2.6) 

‘j 

Almost always the basis is understood and fixed throughout. For this reason, physi- 
cists and engineers often identify tensors with their components. So, for example, 
T 1 ' is often called a tensor, when really it just gives the components of some tensor 
in some basis. Under certain circumstances this transgression can be convenient. 


Technically, this makes T into a graded algebra, just like the algebra of polynomials. 


1 
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But it can also be dangerous because, to be the components of a tensor, the quanti- 
ties T‘ i must behave in a very specific way under a change of basis. We shall return 
to this issue in Section 2.3. 

As a simple illustration of tensor products and components, let us find the 
components of the tensor v <g> w? We have 



= ^ v‘ w j ( e, ® ej), (2.7) 

ij 

so the components of v <g> w are just v'w' . The same idea works in general. For 
example, the components of u (g> v <g> w are just u l vhjj k and so on. 

It is perhaps worth observing that a tensor of the form v <g> w for some vectors 
v and w is not the most general order-2 tensor. The reason is that the most gen- 
eral order-2 tensor has nine algebraically independent components, whereas Vj wj 
has only six algebraically independent components (three from each vector). For 
example, v 2 w i = (v 1 wi)(v 2 u) 2 )/v l w 2 . 


Example 2.1 Some examples of tensors you may have already encountered include 
the inertia tensor and the electromagnetic field strength tensor. Given a rigid body 
consisting of a bunch of point masses m a at positions r a = (x a , i , x a r_, x a p), its 

inertia tensor is 

7 ij — ^ ', m a( r a^ij X a [X a j'), 
a 

where r 3 = r u ■ r u . and ■ is the usual Euclidean dot product. Of course, this is typical 
physics sloppiness: /;/ is not a tensor, it is the component of a tensor in the standard 
basis [e\, e 2 , e 2 } of M 3 . Moreover, the indices are in the wrong place. If we were 
being really pedantic we would write 

I ,J = -x l a x J a ), 

a 

in which case the tensor itself would be I = ]F (/ V 1 a <E) ej. The components of the 
inertia tensor of a continuous body are given by 

I‘j = J p(r) ^ r 2 8‘j — d 3 x, 


- In some books n ® u) is called a tensor of rank 2. The problem with this terminology is that it conflicts with 
standard usage in matrix theory. If we consider the components V( wj of the tensor v (8) w in some basis to be 
the components of a matrix, then this matrix only has rank 1 . (The rows are all multiples of each other.) To 
avoid this problem, one usually says that a tensor of the form v (8) w (8) • • • has rank 1 . Any tensor is a sum 
of rank-1 tensors, and we say that the rank of the tensor is the minimum number of rank-1 tensors needed to 
write it as such a sum. Finding methods to compute the rank of a tensor efficiently is a difficult open problem. 
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where p{r ) is the density at the point r . (Here the integrand is something called a 
“tensor density”, which we discuss in Exercise 8.52.) Similarly, if A ^ denotes the 
electromagnetic vector potential, the field strength tensor is 

Ffiv = - 3 

where 3 M := 3/3x M . Once again this is sloppy nomenclature because the F jlv are the 
components of the field strength tensor F in some basis. Exactly what basis is being 
used here will have to wait until we reach Example 3.14. 


2.2 General tensors 

It turns out that the previous definition of a tensor is a bit too restrictive for many 
purposes and must be expanded to include dual objects. Let V be a vector space 
and V* its dual space. Then a tensor T of type (r, s) is an element of the tensor 
product space 

r; = v 0 v ® • • • 0 v ® 0 0 • • • 0 l* = 0 (v*)®*. (2.8) 

" v ' v ' 

r times s times 

What we previously called a tensor of order r is just a tensor of type (r, 0). 
The properties of the tensor product ensure that the space of all tensors forms a 

multigraded algebra, as before. 3 

Pick a basis {e,-} for V and a dual basis {0 l } for V*. Then a basis for the space 
T' comprises objects of the form 

e h ® e i2 ® • • • (8) e ir ® 9 jl ® d h ® • • • ® 6 js , (2.9) 

where all indices run from 1 to dim V. A general tensor T of type (r, s) is a linear 
combination of the basis elements: 

T = ^2 T n ‘ 2 '" lr jij 2 ~j s e i\ ® e i2 ® • • • ® e ir ® 6 Jl ® 9 h ® • • • ® 9 Js . (2.10) 

h Js 

As before the elements T l<l2 " Jr j t j 2 ...j s are called the components of T . In order to 
avoid having to write out the summation symbols, Einstein introduced his summa- 
tion convention: whenever two indices with the same labels appear on the same 
side of an equation, one assumes that there is an implicit sum over those indices. 
This allows us to rewrite (2.10) as 

T = T n,2 - ,r j lh _j s e h ® e h ® • • • ® e ir ® 9 Jl ® 9' 2 ® • • • ® 9 js . (2.1 1) 


3 


The term “multigraded” in this context means that if S is a tensor of type (r, s) and T is a tensor of type ( p,q ) 
then S (8) T is a tensor of type (r + p, s + q). 
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This often saves much writing, but it does require that one should keep a careful 
track of all the indices. We shall adopt this convention later in the book, but for 
now we leave the summation symbols in place. 

2.3 Change of basis 

It follows immediately from our discussion and the results of Section 1.7 that 
the components of a tensor transform under a change of basis as one would 
expect from the placement of their indices. Upstairs indices transform contravari- 
antly, whereas downstairs indices transform covariantly. Before we can write down 
the actual transformation law, though, we need to introduce another notational 
convention. 

In Section 1.7 we wrote {u"} to denote the components of the vector v relative 
to the new basis {e^}, but this notation can be problematic in the context of tensors, 
because tensors have more than one index. Each index goes with a different basis 
element, so it is convenient to put the primes on the indices rather than on the 
tensor label itself. Thus, for example, (1.18) would be written v 1 ' = 

According to this convention, the transformation law for tensor components under 
a change of basis is written 

= E Th Jr j^ A l y [ b ■ ■ ■ (A-^ ir A h j[ ■ • • Ai’j'. (2.12) 

q ir 

This notation has advantages and disadvantages. The primary advantage is that, 
by putting the primes on the indices rather than on the tensor symbol itself, it sug- 
gests that we are dealing with the same tensor but in different coordinate systems. 
The primary disadvantage is that it is not clear to which components we are refer- 
ring, new or old, when writing something like T ,l2 | 34 , say. So, in this notation, 
one must really write T 3 , ' 2 ' uy 4 ' to refer to a particular component of T in the new 
coordinate system. 


2.4 Tensors as multilinear maps 

There is yet another way of looking at tensors that is very handy in practice. It 
exploits the isomorphism between a vector space and its dual. The map 

T : U* x • •• x U* x V x • •• x V R (2.13) 

■V- v*“ 

r times s times 

is said to be multilinear if it is linear in each entry: 


T (iq, . . . , au + bw, ... , ry+J 

= aT(vi u, . . . , n f+J ) +bT{ v u ...,w,..., v r+s ), 


(2.14) 


2.5 Symmetry types of tensors 
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where t>i, . . . , v r+s , u, and w are elements of V or V* as appropriate, and a and b 
are scalars. The space of all such maps is linear under pointwise addition and scalar 
multiplication. Let us denote it by T*' . 

By employing the natural pairing (, ) (see Section 1.6) between V and V*, we 
may view the tensor 

e/j ® • • • ® e ir ® 9 jl ® • • • ® 9 js (2.15) 


as a multilinear map on the Cartesian product space ( V*) xr x V /X,s that acts 
according to the rule 

(e h ® • • • ® e ir ® 9 Jl ® • • • ® 9 js )(9 kl 9 k \e t e is ) 

= [e h ,9 k ')---[e ir ,9 k ')[9i\e h )---{9\e is ) 

■■■«/:• ( 2 -! 6 ) 

In other words, the tensor product symbol acts like a placeholder that just tells you 
which covectors or vectors should be paired. Viewing the tensor product in this 
way as a multilinear map, we have 

T(9 k \...,9 K ,e ei ,...,eO 



* 1 . -.*> 


x (e i] ® e, 2 ® • • • ® e, r ® 9 jl ® 0 72 ® • • • ® , . . . , 9 kr , , . . . , e fj ) 

i\ b 

jly-Js 

= T klk ^ llh .. Xs . (2.17) 

This establishes an isomorphism from Tf to Tf . Essentially, one can view a tensor 
either passively as an element of a certain vector space (the tensor product space) 
or actively as a multilinear - functional on the dual of that vector space. They are 
two sides of the same coin. Therefore we may interchange multilinear maps and 
tensors as we please. 


2.5 Symmetry types of tensors 

Let T be a tensor of type (0, 2) with components Tjj in some basis. If T t j — Tp we 
say T is symmetric, while if 7} ; - = —Tji we say T is antisymmetric. Of course, a 
general (0, 2) tensor has no such property. But if a (0, 2) tensor has either property, 
we say it has definite symmetry. 

EXERCISE 2.1 As we have defined it, the condition of having definite symmetry 
would appear to be basis dependent. Using the tensor component transformation 
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law (2.12) show that this is not the case. That is, if T is symmetric (respectively, 
antisymmetric) for one choice of basis then it is symmetric (respectively, antisym- 
metric) for all choices of basis. 


EXERCISE 2.2 Let Ajj be symmetric and B ,J be antisymmetric. Show that 
£,y AijBV = 0. 


The symmetric part T sym of T is the symmetric tensor with components 

(T sym )ij ■= ^ i T U + Tji) (2.18) 

while the antisymmetric part r asym of T is the antisymmetric tensor with 
components 


(Tasym )ij ^ (^' £') ' 


(2.19) 


Evidently, for a (0, 2) tensor, T is the sum of its symmetric and antisymmetric 
parts. 

These ideas can be generalized to higher-order tensors, but the results are not 
as simple. Defining what is meant by “definite symmetry” for higher-order tensors 
requires some understanding of the representation theory of the symmetric group, 
which is beyond the scope of this work. Fortunately, for most purposes we need 
only consider the higher-order analogues of symmetric and antisymmetric tensors, 
and these are easy enough to describe using basic ideas of permutation theory, as 
follows. 

Let T be a tensor of type (0, p) with components Define the symmetric 

and antisymmetric parts of T by 


1 

(2sy m )i 1 i2...i p •— / , Ti n <X)ia(2)—iBtp) 

P ' <xe6 D 


=: Ttj.j 


(ilh—ip) 


(2.20) 


and 


( 2 asym)iii 2 ...ip •— , ( 1 ) Ti a (i)io(Z)—i'x(.p) — • ^Uih—ip] > 

P ' creSn 


(2.21) 


where & p is the set of all permutations of p elements and (—1)" denotes the 
sign of the permutation a:' The rightmost expressions give a convenient short- 
hand notation. Observe that if p > 2 then it is no longer true that T is the sum of 
its symmetric and antisymmetric parts. 


4 


See Appendix A. 


2.5 Symmetry types of tensors 
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A (0, p ) tensor is (totally) symmetric (or simply, symmetric), if 

^ookra-'W) = T hh...i P for evef y e & p • ( 2 - 22 ) 

It is (totally) antisymmetric (or skew symmetric, or alternating) if 

T i. m iaiiyJ aW = (-iy r hi 2 ...i p for every O- e6 r (2.23) 

The notions of symmetric and antisymmetric tensors extend mutatis mutandis to 
tensors of type ( p , 0) on just raising all the indices. Moreover, symmetry type is 
preserved under the taking of linear combinations of tensors of the same symmetry 
type; thus, the set of all symmetric and the set of all antisymmetric tensors are both 
subspaces of the space of all tensors. Specifically, we write Synd’ V (respectively, 
AIT' V) for the subspace of symmetric (respectively, antisymmetric) (0, p) tensors 
or {p, 0) tensors. 


EXERCISE 2.3 Show that, as the names imply, if T is a general (0, p) tensor then 
7’ syrn is totally symmetric and 7 asynl is totally antisymmetric. Extended hint: To get 
you started, we prove the antisymmetric case here and leave the symmetric case to 
you. We have the following chain of equalities: 


1 '<x(r(l))'a(T(2))-!<x(rO>)) 


r (p) 


(7asym)/ t{1) ...! r o, ) ~~ i E ( ^ Ti 

’ ve& p 

| ^ ' ( 1) 2 itrr(l)'(Tr(2).. '(Tr 

creGp 

= 3 £ <-»"X 

71 T 1 G &p 

= (-D^ E 1)^(2,- 

7TT _1 G©p 

= (-i) r -|i E 1)%<2,..^ 


T(l)‘jr(2)--- , ir(p) 


'jt(p) 


7ie& D 


— ( 1) (7asym)ii!2.../ p ■ 


(2.24) 

(2.25) 

(2.26) 

(2.27) 

(2.28) 
(2.29) 


These equalities hold for the following reasons: (2.24) and (2.25) hold by definition; 
in (2.26) we have set tt = or; (2.27) holds because the sign function is a group 
homomorphism from 6 p to ±1, so that (~\) 7TT = (— I ) T (— I ) r , and because 
a permutation has the same sign as its inverse, since 1 = (— \) td = ( — l) rr * = 
(— 1) T (— 1) T ; (2.28) holds because 7rr _1 runs over all permutations as i r runs over 

all permutations; lastly, (2.29) is true by definition. 


Remark We have introduced the idea of symmetry from the point of view of 
tensor components, as this seems to be the most elementary way to do it. But, as 
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discussed in Section 2.4, we can view all this from the perspective of multilinear 
maps. Viewed as a multilinear map, the condition that T be symmetric is just 

T (ttff(i) , v a( 2 ), v a(p) ) = T(y 1 , v 2 , V p ), (2.30) 

while the condition that T be antisymmetric is 

T{v a( p, v a{2) , v a(p) ) = v 2 v p ), (2.31) 

for any collection of p vectors {v\,V 2 , ... ,v p ) and any permutation a e & p . From 
this perspective, the basis independence of the symmetry conditions is obvious. 


2.6 Alternating tensors and the space / \ p V of p-vectors 

In this section we restrict our attention to alternating tensors, for which one can 
construct a very powerful and elegant stand-alone theory that leads eventually to 
the concept of a differential form. The theory is made possible by the use of a 
convenient shorthand notation for an alternating tensor product. 

Given two vectors v, w e V, define their wedge or exterior product by 5 

v A w = v®w — w®v. (2.32) 

The wedge product of two vectors is called a 2-vector, and the vector space gen- 
erated by the set of all 2- vectors is denoted /\ 2 V . Observe that, for any vector v, 
v A v = 0. 

We claim that /\" V is naturally isomorphic to the vector space Alt 2 V of alter- 
nating (2, 0) tensors. First note that /\~ V is spanned by all 2- vectors of the form 
<?; A ej where i < j . To see this, choose a basis [e\ , <? 2 , . . . , e n } for V. If 

v = v'ej and w = w^e j, 
i j 

then, by the linearity of the tensor product, 

v A w — (u'e; <8> w-’ej — w J ej <g> n'c,) = i/uF g, a ej. 

ij ij 

By a swap of dummy indices and by the antisymmetric nature of the wedge 
product, we have 

v'w' ei A e j = iFu/ ej A a = — v J w' e, A ej. 

‘j ji ij 


5 


Unfortunately, there are competing conventions here. Many authors define v A w as (v (8) w — w (g> v)/2. See 
the discussion in footnote 8. 
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Therefore we can write (again using the antisymmetry property of the wedge 
product) 


v A w = 


1 

2 


y^(u' w j 

ij 


— v ] w‘)ei A ej 


^^(i/ w j — v' w l )ei A ej. 
i<j 


It follows that any linear combination of 2-vectors can be written as a linear combi- 
nation of the elements ej A ej for i < j . Moreover, these 2-vectors are all linearly 
independent (check!), so they form a basis for the space of 2-vectors. There are 
(") such vectors, where (") = is the binomial coefficient “n choose k”, so 

this is the dimension of /\~ V . For example, if V is four dimensional, the ( 2 ) = 6 
vectors e\ A e 2 , O A e 3 , e\ A e 4 , e 2 A e 3 , e 2 A e 4 , and e 3 A e 4 form a basis for the 
space /\ 2 V. 

However, (”) is also the dimension of Alt 2 V, because the components of an 
alternating (2, 0) tensor satisfy T 1 ' = —T' 1 , which means that the only indepen- 
dent components of T are those for which i < j . The linear map \// : Alt 2 V —>■ 
/ \ 2 V given by T ^ f < . T' ! e, A c ; - furnishes the natural isomorphism. 

Although we could now go on to define wedge products of more than two 
vectors in terms of tensor products, it is much simpler and more elegant to 
proceed axiomatically. Thus, let us begin again and define the wedge product 
Vi A V 2 A ■ • • A v p of p vectors Vi, u 2 , . . . , v p so that it satisfies the following 
two axioms. 


(1) (multilinearity) For every i and all scalars a and b, 


ith position 

A" m ^ m ’ \ 

t>i A ■ ■ ■ A ( au + bv) A • • • A Vp = a(v\ A • • • A u A • • • A Vp) 

+ b(v i A • • ■ A v A ■ ■ ■ A Vp). (2.33) 

(2) (antisymmetry) For every i, j , 

V\ A ■ ■ ■ A Vi A ■ ■ ■ A Vj A ■ ■ ■ A V p = —V\ A ■ ■ ■ A Vj A ■ ■ ■ A Vj A ■ ■ ■ A V p . (2.34) 

A wedge product of p vectors is called a p-vector, and the vector space generated 
by all the /;- vectors is denoted f\ p V. Obviously /\' V = V, and by convention 

/\° V = IF, where IF is the underlying field. A basis for /\ p V is given by all 

/;- vectors of the form 


ej := e,j A • • • A e ip , 


(2.35) 
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where the multi-index / denotes the ordered index collection 6 (i\, ... , i p ) with 

1 < ii < ■ ■ • < i p < dim V. (2.36) 

We write |/| = p to indicate that I contains p indices. The dimension of /\ p V is 
("), because there are this many basis vectors of the form (2.35). In ordinary and 
multi-index notation, a general -vector in /\ p V is written as 

co = — a n ' lp ei l A ■ ■ ■ A ei p = J ^ci I * ei, (2.37) 

P ' I 

where a n ' ,p is totally antisymmetric and a 1 = n ( ' e IF. A single term in this 
sum is called a monomial. 

Example 2.2 In (2.37) we have stipulated that the components a n " ,p be totally 
antisymmetric. Suppose for the moment that we had not insisted on this. Consider 
what would happen in the case p = 2, n = 3. Writing out all the components 
explicitly gives 

1 / 1 1 19 ?1 22 

^ [a e\ A e\ + a e\ A e2 + a e2 A e\ +a e2 A e2 

+ a X 3 e i A e-i + a 3 l * e 3 A e\ + a 23 e 2 A ej + a 32 e 3 A e2 + a 33 e 3 A ejj 

= j (a 12 — a 21 ) ei A e2 + ^ (a 13 — cr 31 ) e\ A e^ + \ ^a 23 — a 32 ) e2 A e?,. 

In other words, the components of the basis vectors e; A e j with i < j are the 
antisymmetric parts of a. If a had a symmetric part, it would disappear from this 
expression. This is why we may as well demand that a be antisymmetric from the 
start. Moreover, when a is antisymmetric, the above expression reduces to 

12 13 23 

a e\ A e2 + a e\ A e^ + a e2 A e$, 

which is precisely a 1 ei in this case. 

EXERCISE 2.4 Generalize the previous example by showing that 

^ a ll '" lp e il A • • • A e ip = ^ a [n " lp] e il A • • • A e ip . (2.38) 

hr--, ip ii,-,ip 


6 We adopt the standard convention that parentheses denote an ordered set whereas curly braces denote an 
unordered set. 

7 Warning: Be careful to distinguish a^‘ “ ip ^ (without the commas) from (with the commas). The 

former is the totally symmetric part of a as defined in ( 2 . 20 ), whereas the latter vanishes unless the indices are 

in increasing order. For example, if i\ = 3, z '2 = 1, and z ‘3 = 2, then 

a<312) = i(a 123 + a 132 + a 213 + a 231 + « 312 + a 321 ), 

whereas a^ 3 - 1 ’ 2 ) = 0 (and rd 1 , 2.3) = a 123 ). 
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In other words, a general p-vector is determined uniquely by the antisymmetric com- 
ponents of a. It follows that / \ p V is isomorphic to A lt / ' V. Hint: Feel free to use the 
fact easily proved from axiom (2.34) that, for any permutation a e & p , 

i) A v v ( 2 ) A ‘ ‘ ‘ A v ia (P ) = (- i)" v n A v,- 2 A ■ • ■ A v ip . (2.39) 

You will also need to use dummy indices cleverly. 

Axioms (2.33) and (2.34) imply that 

V\ A v 2 A • • • A v p - c p ^ (-1)' T u ff( i) ® v a(2 ) ® • • • ® V a(p) , (2.40) 

creSp 

where c p is some constant. We have chosen ci_ = 1 in (2.32), so to be consistent we 
should probably take c p = 1 ; other authors take c p = 1/p!. Fortunately, the choice 
is mostly irrelevant. As long as we understand the wedge product via the axioms 
(2.33) and (2.34), we need never go back and forth between wedge products and 
tensor products. 8 It is much easier to deal with wedge products than with alternat- 
ing sums such as those on the right-hand side of (2.40), and for the most part this 
is what everyone does. 


2.7 The exterior algebra 

The wedge product turns the collection f\ V of all /\ p V for p = 0, 1,2,... into a 
graded algebra, called the exterior algebra of V. Given X e /\ p V and jjt e f\ q V, 
we define X A p, e ^ i n the obvious way, namely we set 

(iq A • • • A v p ) A (uq A • • • A w q ) = iq A • • • A v p A uq A • • • A w q (2.41) 

and extend by linearity. What could be simpler? It follows that the exterior product 
satisfies the usual properties of distributivity and associativity, together with what 
is sometimes called graded antisymmetry: 

X A (a ii + bv) = aX A /x + bX A v (distributivity), (2.42) 

X A (/x A v) = (X A pt) A v (associativity), (2.43) 

[i A X = (— \) pq X A n (graded antisymmetry). (2.44) 

EXERCISE 2.5 Prove property (2.44). 

EXERCISE 2.6 The only purpose of this very tedious exercise is to emphasize 
that when dealing with alternating tensors it is far preferable to stick with the 


The one exception to this rule is when one views tensors as multilinear maps, for then those annoying prefactors 
again rear their ugly heads. But here again axiomatics save the day for us, for we will define the natural 
pairing of /7-vectors and /7-tuples of vectors via the interior product; this allows us to avoid any mention of 
combinatorial prefactors. See Section 3.13. 
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axiomatically defined wedge product. Suppose that, in spite of our suggestion, you 
insisted on defining p-vectors in terms of tensor products. You would then define a 
linear map alt : T® -* Alt p V by 



extended by linearity. You might go on to define the wedge product of 5 e T® and 


TeT°by 


S A T = alt(S ® T). 


(2.46) 


(Note that this differs from our previous convention by a numerical factor.) Use 
(2.46) to prove that the wedge product so defined is associative. Remark: The wedge 
product defined this way also satisfies properties (2.42) and (2.44). Property (2.42) 
is immediate from the multilinearity of alt, while (2.44) follows from the antisym- 
metry of the wedge product for vectors, as before. Hint: Proceed in steps. Let S and 
T be tensors, (i) Show that alt(alt(7’)) = alt(T). (ii) Show that if alt (.S') = 0 then 
S AT = T A S = 0. (iii) Use the linearity of alt and properties (i) and (ii) to show 
that (R A S) A T — alt( R & S <E) T) — 0, where, say, R e 7).°. Argue that a similar 
result holds for R A (S A T ). If you’re really stuck, look in [33] or [79]. 


2.8 The induced linear transformation /\ T 


Let T : V — »■ V be a linear map. We define the pth exterior power of T to be the 
linear map 


A P T : /\ P V -* /\ P V 


(2.47) 


given by 


(f\ p T)(vi A • • • A v p ) = Tvi A • • • A Tv p . (2.48) 


(2.48) 


This map is natural, in the sense that 


/\ p (ST)( Vl A • • ■ A Vp) — (ST)v\ A • • • A (ST)Vp 


= (/\ P S)(T Vi A ■ ■ ■ aTv p ) 

= (A P S)(A P T)(vi A ••• A Vp), (2.49) 


(2.50) 


so that 


A P (ST ) = (/\ P S)(/\ P T). 


(2.51) 
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2.8 The induced linear transformation f\ T 

Similarly, if A e /\ p V and /z e /\ ? V then 

(A P+C, T){\ A/z) = (A P r)(A) A (A^X/z). (2.52) 

Of particular interest is the special case in which p = n — dim V. In that case, 
because dim A" V = 1 the map A" T is a linear map between one-dimensional 
vector spaces and hence is multiplication by a scalar. If v\, ... ,v n are linearly 
independent vectors then 

(A "T)(v\ A • • • A t>„) = Tv[ A • • • A Tv n = (det T)(v i A • • • A u„). (2.53) 

The scalar det T is called the determinant of the map T, because if T represents 
T in some basis then det T is the usual matrix determinant det T . 

EXERCISE 2.7 Let T : 1R 3 — >■ R 3 be givenby 7Ti = ei+2e2, Te 2 = 3e2+2e3, and 
T e?, = e\ + e 3 , where {e \ , e 2 , e$] is the standard basis. Compute A 3 T (e\ A e 2 A ef) 
directly from (2.48) and the properties of the wedge product and compare it to det T . 

EXERCISE 2.8 Prove that det T = det T in any basis, where det T is defined by 
(1.46). (Actually, you need to use (1.46) together with (1.56).) 

Pick a basis { e ,• } of V, and recall the multi-index notation <?/ introduced in (2.35). 
Then we have 9 10 


(A p T)e I = Te h A • • • A T e ip 

= (Ai'1 ■ ■ ■ Tj P ip) e h A • • • a e j p 

jl-jn 

= y. g./ Tj / , 

j 

which says that the map f\ p T is represented by the matrix (Tji) in the natural 
basis. This matrix is called the /;th compound matrix of (7), ) and is the matrix of 
minors of (7^) indexed by rows J and columns 7. 

EXERCISE 2.9 Let V be a three-dimensional vector space with basis {ei, e 2 , ej], 
and choose the basis { e i A ej ■ eiAe^, e 2 Ae 3 ) for A“ V. Show by explicit computation 
that the second compound matrix of (Tjj) is given by 

( Til 722 - 721 7)2 7X 723 — 721 Ti3 T 12 T 23 — T 2 2 T\f\ 

T’l 1 732 — 731 X 2 r u r 3 3 - 731 713 7’i 2 7’33 — 7’3 2 7’i3 I . 

Til 732 731 722 Til 733 — 731 723 722733 — 732723/ 


9 As discussed previously, this is one of those cases where the upstairs/downstairs placement of indices is 
entirely irrelevant and would just clutter up the formulae. 

10 The minor of a matrix (TJj) indexed by rows J and columns I is the determinant of the matrix obtained from 
(Tjj) by crossing out all rows except those indexed by J and all columns except those indexed by / . 
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By (2.51), the pth compound of the product of two matrices is the product of 
the pth compounds of each matrix, a useful fact which is very difficult to prove 
directly. In particular, when p = n this is just the statement that the determinant of 
a product of two matrices is the product of their determinants: 

det(ST) = (det S)(det T). (2.54) 

EXERCISE 2.10 Using (2.53) and the result of Exercise 2.8, prove the following 
statements for any matrix. (1) The determinant changes sign whenever any two 
rows or columns are interchanged. (2) The determinant vanishes if any two rows 
or columns are equal. (3) The determinant is unchanged if we add a multiple of any 
row to another row or a multiple of any column to another column. (4) Multiplying a 
column or row by a constant multiplies the determinant by the same constant. Hint: 
Let } be a basis for V , and consider a linear map T : V -* V satisfying T e; = u,-, 
so that Vi is the /th column of the matrix T that represents T in the basis {e, }. You 
may want to invoke (1.56). 


2.9 The Hodge dual 

An inner product g on V induces a natural inner product (also denoted g) on /\ p V, 
as follows. If A = v\ A • • • A v p and pt = wi A • • • A w p , we define 

g(X, /x) := det(g(ui, wj)), (2.55) 

where the right-hand side is the determinant of the matrix whose (;, y)th entry is 
g(v,-, wj), and then extend by bilinearity. By (1.56) g is symmetric, so we need 
only verify its nondegeneracy. For this, let {c,} be an orthonormal basis for V. 
Then the claim is that the set {<?/ = e,j A • • • A e lp }, as I varies over all ordered 
index sets of size p, is an orthonormal basis for / \ p V relative to g. Set ej = 
e jl A • • • A e jp . Define (g u )u ■= g(e ik , e jt ). By orthonormality, (g u ) kt = ±8 ikjt , 
so we have 


g(ei, ej) = det ((gu)u) = det (±8 ikjl ). 

Suppose j p £ I, say, so that I J . Then the rightmost determinant van- 
ishes, because the pth column of (8 ik j e ) vanishes. Similar reasoning shows that 
g(ei,ej) = 0 whenever I ^ J. However, if I = J then g(ei,ej) = ±1. 
Hence the claim is proved. But if an orthonormal basis exists, the metric must 
be nondegenerate by Lemma 1.6. 

Example 2.3 Let {e\, e k , ■ ■ ■ , e„j be a basis for V, and define gij = g (e , , ej). 
Choose er = e\ A ■ ■ ■ A e n . Then g(t r, a) = det (gij). If the basis is orthonormal then 
g(cr , a) = where n — 2d is the signature of g. 


2. 9 The Hodge dual 


45 


In preparation for the main definition of this section, fix A € f\ p V . Then we 
have a natural linear map from f\'~ p ^ to A" ^ given by 

/x — > A A /x. (2.56) 

But A" I 7 is a one-dimensional vector space spanned by some element a , so 

A A /x = A(/x)cr (2.57) 

for some linear functional f on f\~ p V . Given an inner product g, the Riesz 
lemma guarantees the existence of a unique (n — /;)-vector *A such that 

g(*A,/x) = A(/x). (2.58) 

The element *A e /\ n ~ p V is called the Hodge dual or Hodge star of A. 
Combining (2.57) and (2.58) we may write 

A A /x = g(*X, (2.59) 

Written this way it is clear that the Hodge dual depends on both the inner product 
g and the choice of basis element a for A” V ■ 

At first sight the definition of the Hodge dual appears to be worthless, because it 
asserts the existence of an object but seems to give you no means of computing it. 
It is true that the definition is indirect but, as we shall soon see, one can actually use 
(2.59) to compute things. Of course, you might be wondering what it’s good for. 
The answer is basically that Hodge duality provides an elegant means of writing 
various important formulae (see e.g. Example 3.14). 

Arguably the most important properties of the Hodge dual are given in the 
following theorem. 

Theorem 2.1 Let V be n-dimensional with inner product g. Let rj, A g f\ 1 ' V, 
and choose o € A" V to satisfy g(o, o) = (— I ) d . Then 


★ *A = (-1 )P(»-P)W A; 

(2.60) 

rj a *A = A A *ij = (—l) d g(r], A )ct. 

(2.61) 


Proof Choose an orthonormal basis {<?,} for V. We must have 

<t = a e\ A • • ■ A e n 

for some constant a , and the hypothesis requires that a — ±1. For definiteness we 
choose a = 1. (The proof for a = —1 is identical.) 
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We prove (2.60) and leave the proof of (2.61) as an exercise. By linearity it 
suffices to verify (2.60) for X = e/ with |/| = p. Without loss of generality we 
may assume that I = (1, 2, ... , p ), so that 

X = e i A ^2 A • A e p. 

If J is any multi-index other than I, (2.59) gives 

A A ej = g(*X,ej)o. (2.62) 

The left-hand side vanishes unless J — {p + \, p + 2, ... ,n), so we must have 

★A = ce p+ 1 A • • • A e„ 

for some constant c that can only be ± 1 . The constant is determined by choosing 
J = (p + \, p + 2, ... , n), so that (2.62) becomes 

a = cg(ej, ej)a. 

It follows that 

c = g(ej,ej), 

so we may write 

*<?/ = g(ej,ej)ej. (2.63) 

A similar argument shows that 

+ej = bg(ei, e,)e, 

for some constant b, determined as follows. We have 

(_1 y(n-p) a — (_\y(n-p) ei A ej = ej A ei = g(+ ej , erfcr = g{ei, *ej)cr, 
so that 

( -l Y {n ~ p) = g(ej, *ej) = b[g(e ,, e,)] 2 = b 

and thus 

*ej = (-1 )P^g(e I ,e I )e I . (2.64) 

We conclude that 

= (-1 ) p{n ~ p] g{ej,e J )g(e I ,ei)e I 
= (-lV (n ~ p) g(ct,a)e I 
= {-\) p(n - p)+d ei , 

where the equality used in the penultimate step, 


follows from (2.55). 


g{e I ,ei)g{e J ,e J ) = g(cr,a), 


(2.65) 

□ 


2. 9 The Hodge dual 
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EXERCISE 2.11 Prove (2.61). 


Example 2.4 Let V = M 4 and let eo = (1,0, 0,0), e\ = (0, 1,0,0), e 2 = 
(0, 0, 1, 0), and e 3 = (0, 0, 0, 1). Thus 


(su) 


/-I 0 0 0\ 

0 10 0 

0 0 10 

\ o 0 0 1/ 


( 2 . 66 ) 


Choose a — eo Aei A e 2 A e 3 , so that g(a, o) = det (g//) = — 1. Let us compute *eo, 
for which p = 1 . As n = 4, *eo will be a 3-vector, so it can be written as 

*<?o = ^2 a 1 1,213 e^ A e i2 A e,- 3 

for some constants a' 112 ' 3 . Equation (2.61) gives 

eo A *eo = -g(e o, eo)ff = cr, 
ej A *eo = —g(ei,eo)cr — 0, 
e 2 A *e 0 = -g(e 2 , e 0 )cr = 0, 
e 3 A *eo = -g(e 3 , eo)cr = 0, 

from which we conclude that 


Similar calculations yield 


Similarly, we get 


★ eo = ei A e 2 A e 3 . 

(2.67) 

★ei = eo A e 2 A e 3 , 

( 2 . 68 ) 

★e 2 = eo A e 3 A ei, 

(2.69) 

★e 3 = eo A e\ Aei. 

(2.70) 

gives 

\ e 3 ) = **e 0 = (- 1) 1(4 1 )+ 1 e 0 = e 0 . 

(2.71) 

★(e 0 A e 2 A e 3 ) = e 3 , 

(2.72) 

*(eo A e 3 A e\) = e 2 , 

(2.73) 

*(e 0 A ei A e 2 ) = e 3 . 

(2.74) 


Next, let’s find *(eo A ei), which is a 2-vector. Equation (2.61) gives 
(eo A e\) A *(eo A e\) = -g(e 0 A e\, eo A ei)er = a, 

because 


g(e o,e 0 ) ^(e 0 , ei) 


-1 0 

g(e i,e 0 ) g(ei,e!) 


0 1 


g(e 0 Aei,e 0 Aei) = 


= - 1 . 
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When (i, j) ^ {(0, 1), (1, 0)} we have 

(e; A ej) A *(e 0 A ei) = -g(e, A ej, e 0 A e\)o = 0. 


Therefore 


By analogy we have 


★ (eo A ei) = e 2 A e 3 . 

★(eo A ei) = ej A e\, 
*(e o A ei) = e\ A e 2 . 


(2.75) 

(2.76) 

(2.77) 


But the analogy stops here, because eo is different from the other basis vectors. For 
example, 

*(ei A ei) = —eo A e$, (2.78) 

*(e 2 A ei) = -eo A e \ , (2.79) 

*(e 3 A ei) = — eo A e 2 . (2.80) 

These equations can be obtained in two ways. For example, to obtain (2.78) we can 

proceed as before to get 

(ei A e 2 ) A *(ei A e 2 ) = —g(e i A e 2 , e\ A ei)a = —a, 

because 

, . , \ g(e i,ei) g(ei,e 2 ) 1 0 

g(ei A e 2 , e\ A e 2 ) = = = 1. 

g(e 2,ei) g(e 2 ,e 2 ) 0 1 

Similarly, we compute 

(e; A ej) A *(ei A e 2 ) = -g(e,- A ej , e\ A e 2 )<r = 0, 

whenever (i, j) ^ {(1, 2), (2, 1)}, from which we conclude that we must have (2.78). 
Alternatively, we could simply notice from (2.77) that 

*(ei A ei = * 2 (e 0 A e 3 ), 

and then use the fact that, on 2-vectors, 

^2 _ ^ ] }P(n — p)+d _ ^ ]^2(4— 2)+l _ j 

Lastly, observe that 

g A *<7 = — g(er, cr)(T = cr, 

giving 

★ cr = 1 and *1 = cr, (2.81) 


a result valid more generally for any signature inner product on R" . 


Additional exercises 
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2.12 Linear independence and tensor products Let {ui, . . . , i^} be a set of 

linearly independent vectors in V . Show that, for any vectors w , Wk, 

k 

Vi ® Wi = 0 =>• W\ — ■ ■ ■ — Wk — 0 . 

i=i 


Hint: Complete {m, . . . , Vk} to a basis. 

2.13 The Kronecker product of two matrices Let (A ;/ ) be an m x m matrix and 
let ( Bjj) be a « x n matrix. The Kronecker product A <g> B is the mn x mn 
block matrix 


B 


( A n B 


\A m l ^ 



More explicitly, A ® B is 


( A n B n 

A U B \2 

A \\ B \n 

Mm B \\ 

A\m B n 


A n B 2\ 

A 1 1 B 22 

A \\Bl n 

Mm B 2\ 

A lm B 22 

A 1 m B 2 n 

A \\ B n\ 

A ll B nl 

A j j B nn 

^ 1 m B n 1 

A 1 m B n 2 

A\m B nn 

A m 1 B l 1 

A m\ B U 

A m \ B\ n 

Amm ^11 

A m m B 12 

Amm B \n 

A ml B 2l 

A m 1 B 22 

A m 1 ^2 n 

b 2\ 

Amm B 22 

Amm B 2 n 

' A m 1 B n 1 

A m\ B n2 

A m lBnn 

Amm B nl 

Amm B n2 

Amm Bnn/ 


Now let A : V V and B : W — W be linear maps and let v e V and 
w e W . The tensor product map A 0 B : V ® W — > V <%> IT is naturally 
defined by 


(A <g> B)(v ® w) = Av <S> Bw, 

extended by linearity. Pick a basis {ei, , e m } for V and {/i, for 

IT. If A is represented by the matrix A and B is represented by the matrix B 
relative to the chosen basis, show that A ® B is represented by the Kronecker 
product matrix A <g> B relative to the basis {<?, ® /_,■}, 1 < i < m, 1 < j < n, 
of V <g> W, provided we order the basis elements of V <%) W in lexicographic 
order, namely as e\ ® fi,...,e 2 ® fi,e 2 ® fi, . . . , e m ® /„. 
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2.14 Linear independence and wedge products Show that v\ A • • • A v p = 0 if 
and only if the set {iq, iq, ■ ■ ■ , v p ] is linearly dependent. 

2.15 Cartan’s lemma Let {iq, . . . , iq} be a set of linearly independent vectors 
in V, and suppose that 

V[ A W\ + l>2 A U>2 + ' ' ' + Vk A Wk = 0 

for some vectors uq , . . . , uq. Prove that 

k 

W < = J2 A 'i V i 

7=1 

for some symmetric matrix A (i.e., A,j = A /( ). Hint: Complete v\ Vk to 

a basis of V . 

2.16 The Sylvester-Franke theorem Let V be an //-dimensional vector space 
and A : V V a linear map. Show that 

det/\ /J A = (detA)^- 1 ^. 

Hint: Assume A to be diagonalizable. The general case then follows by a 
continuity argument. 

2.17 The Hodge dual of a vector Let V be Euclidean //-space, namely M' ! 
equipped with the standard basis e\, ... ,e n and the Euclidean inner prod- 
uct g(ej, ej) = Sjj. Choose a — e\ A • • • A e n . Let /x = JT a i e i- Compute 
★/x. 

2.18 The Hodge dual as pseudo-isometry Show that the Hodge star operator 
preserves the natural inner product on //-forms up to sign. That is, for any 
//-forms A and /x, 

gOA, */x) = (-l) rf g(A, /x), 

where d = (// — t)/2. 

2.19 The wedge product of a direct sum of vector spaces Given two vector 
spaces V and W, define a linear map 

® /\ l w a a+ V © w ) 

by 

(t>l A- • -AUjt)®(u;i A- • -A Wi) (l/i, 0) A- • -A (Vk, 0)A(0, U/i)A- • • A(0, Wi), 

where for clarity we represent v * as an element of V © W by (ry, 0), etc. Use 
this map to show that 

n 

(J) <g> /\ n ~ k W^j = © w )- (2-82) 
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2.20 The characteristic polynomial and the exterior algebra Let V be an n- 

dimensional vector space and A : V — > V a linear map. Then 

n 

det (/ + Z A) = £(tr /\ k A)z k , (2.83) 

k=0 

where / is the identity matrix and tr means “trace”. We will show this in two 
ways. 

(a) First assume A to be diagonalizable. Prove (2.83) by relating the left-hand 
side to the characteristic polynomial of A. Recall that 

n n 

~ [(1 +a k z) = ^e*(ai, • • ■ ,a n )z k , 

k= 1 k= 1 

where ekia.\, . . . , a n ) is the kth elementary symmetric function. 

(b) Now prove (2.83) without assuming diagonalizability, by letting B := 
I + zA and using the distributive property of the wedge product and 
the definition of the trace. For simplicity (as the general proof works 
similarly) you may wish to restrict to the case n — 3. 

2.21 The symmetric algebra Given p vectors v\, . . . , v p e V, their symmetric 
tensor product 

v\ O v 2 O • • • O v p 

is defined by the following two properties. 

(1) (multilinearity) For every i and all scalars a and b, 

ith position 

f S 

Vi O ■ ■ ■ O (au + bv ) O • • • O v p = a (m O • • • O u Q ■ ■ ■ O v p ) 

+ b(v i O • • ■ O v O • • ■ O v p ). 

(2) (symmetry) For every i, j, 

Vi O ■ ■ ■ O Vi O ■ ■ ■ O Vj O • ■ ■ O v p 

= i>i O • • • O vj O • • • O v,- O • • • O v p . 

The symmetric tensor product of p vectors is clearly a symmetric tensor, and 
the vector space of all such tensors is commonly denoted SynV' V. Obviously 
Sym 1 V = V and by convention Sym° V = IF. If {e, } is a basis of V, a basis 
for Sym ,;i V is given by all elements of the form 

e,'i O ■ ■ ■ O e ip (1 < i\ < • • • < i p < dim V), 

so the most general element of SynP’ V can be written 


52 


Multilinear algebra 





where a‘ u "" Ip are the components of a symmetric p-tcnsor. 

(a) Let dim V = n. Show that the dimension of SynV' V is 

(b) The symmetric tensor product turns the collection Sym V of all SynV’ V 
for p = 0, 1, 2, . . . into a graded algebra, called the symmetric algebra 
of V, in the obvious way: 

(t>i O • • • O V P ) o (wi o • • • o Wq) ■.= V\Q ■ ■ ■ QVpQwiQ ■ ■ ■ Qw q . 

Show that Sym V is isomorphic to the polynomial algebra 
F[ej, ej, ■ ■ ■ , e n \, namely, the set of all polynomials with the a acting 
as indeterminates. 

(c) Show that 


p 


Sym^fV © W) = Sym 7 V © Sym 7 ' 7 W . 


(d) A linear map A : V — »■ V induces a linear map Sym^ A : Sym /c V 
Sym / V in the natural way, namely 


Sym A A(v i O • • ■ O v*) = Av\ O • • • O Av k . 


Assuming A to be diagonalizable, verify that 



Hint: Look up “homogeneous symmetric functions.” 

2.22 Fock space In quantum mechanics the space of possible states of a single 
particle is a special kind of (often infinite-dimensional) vector space, called 
a Hilbert space, denoted here by 7i. A state containing p particles is an 
element of TL® P , the tensor product of TL with itself p times. But in relativistic 
quantum mechanics (or quantum field theory) the number of particles is not 
necessarily constant, so the state of the system is now an element of the direct 
sum 


OO 




p = o 


called Fock space. The symmetric and antisymmetric subspaces 


OO 


T+ ■- Q)Sym p n, 
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T ~ '■= © /\ PH ’ 
p = o 

are of particular importance as they represent the spaces of bosonic and 
fermionic particle states, respectively. 

Given a vector v € 7i, the bosonic creation and annihilation operators a ' (v) 
and ci(v) and fermionic creation and annihilation operators b f (v) and b(v) 
act linearly on their respective Fock spaces as follows: 

a'' (n)(ui O • • • O v p ) = v O v\ O • • • O v p , 

p 

a(v)(v] O ■ ■ ■ O v p ) = ^(u, Vi) ui O ■ ■ ■ O % O ■ ■ ■ O v p , 

i= 1 

b T (u)(ui A • • • A Vp) = 1) A «1 A ■ • • A Vp, 

P 

b{v){v 1 A • • • A Vp) = ^(-1 )'(V, Vi) Vi A ■ • • A V} A ■ ■ ■ A V p , 
i= 1 

where ( u , v) is the standard Flermitian inner product on TL (see Appendix B), 
and the caret on v, means that the vector is missing. 

Given two operators A and B, their commutator (respectively, anticommu- 
tator) is [A, B]_ := AB — BA (respectively, [A, B] + := AB + BA). Show 
that the creation and annihilation operators obey the canonical commutation 
relations 

[ci(u),ci(v)]- = 0, 

[o' t (M),n' t (i;)]_ = 0, 

\a(u), rT t (i;)]_ = ( u , u)l 

on T + and the canonical anticommutation relations 

[b(u), b(v)]+ = 0, 

[b \u), b ^(i;)] + = 0, 

[b(u),b\v)] + = (; u , v)l 

on T~ , where 1 is the identity operator. Hint: To show that an operator is 
zero on T + (respectively, T~) it suffices to show that it is zero on every basis 
element and, to do this, it is enough to show that the operator vanishes on a 
basis element of the form vi O • • • O v p (respectively, v\ A ■ ■ ■ A v p ). 


3 

Differentiation on manifolds 


When in doubt, differentiate. 

Shing-Shen Chern (1979). 

Raoul Bott ( 1 982 ). 

The idea of a differentiable manifold had its genesis in the nineteenth century 
with the work of Carl Friedrich Gauss and of Georg Friedrich Bernhard Riemann. 
Gauss was interested in surveying and cartography, which led him to develop the 
tools of calculus on curved surfaces. FTis famous theorema egregium, or remark- 
able theorem, 1 revealed that one could consider the intrinsic properties of a surface 
independently of the way in which it was embedded in three-dimensional space, 
and this led him, Riemann, and others, to abstract these concepts even further. 
Their ideas have had far reaching applications in many areas of mathematics and 
the natural sciences. 

Roughly, an n -dimensional manifold (or n -manifold) can be thought of as a kind 
of patchwork quilt built from pieces of M' ! . Classic examples of 2-manifolds are 
the 2-sphere S 2 and the 2-torus T 2 (see Figure 3.1). Usually one pictures these as 
living in M 3 , but one can consider them in their own right just as bits of M 2 sewn 
together in certain ways. The technical definition of a manifold requires consider- 
able background, which we will try to keep to a minimum. First, we need the idea 
of a topology. 


3.1 Basic topology* 

Consider a basketball. When it is inflated, its surface is a sphere.- But when it is 
deflated its surface is still a topological sphere. In fact, we could deform the sphere 

1 For the younger generation, theorema egregium might be better translated as “egregious theorem”. 

- Ignore the dimples and the hole for the needle. 
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Figure 3.1 The 2-sphere and the 2-torus. 



Figure 3.2 Topological 2-spheres. 


in any way we like and, as long as we do not tear it anywhere, it is still topolog- 
ically a sphere. We say that all these shapes have the same topology but, since 
the distance between the points on the surface has changed, they have different 
geometries (see Figure 3.2). 

At first sight the actual definition of a topology appears to have nothing to do 
with these notions. Only after much study does one begin to see why the following 
definition is reasonable. 4 A topology r on a set X is a family of subsets of X, called 
open sets, satisfying the following. 

(1) Arbitrary unions of open sets are open. 

(2) Finite intersections of open sets are open. 

(3) The empty set 0 and X are both open. 

A topological space (or, simply, a space) is a set X endowed with a topology. 

Example 3.1 Let A be a finite set, and let r be the set of all subsets of X. This is 
called the discrete topology on X. 

A neighborhood of p e X is any open set containing /;. If q lies in a neigh- 
borhood of p we say that q is near p. Topology is therefore sometimes called the 

3 In this book, the phrase “2-sphere” will always mean the standard unit 2-sphere, namely [(x,y, z) € lA : 
x 2 + y 2 +z 2 = 1). 

Readers may wish to skip the rest of this section, and only refer back to it as needed. 
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study of nearness relations.'’ A topology on X is called Hausdorff if the points 
in every pair lie in disjoint neighborhoods or, more technically, if for every two 
points p, q € X , there exist two disjoint open sets U and V such that p e U 
and q € V . We will primarily be interested in Hausdorff topologies, because these 
coincide with our intuition that points are isolated objects; other sorts of topologies 
are generally considered to be pathological (at least by non-topologists). 

EXERCISE 3.1 Let Y be a subset of a topological space X. Suppose that for every 
y e Y there is an open set U with y e U C Y. Show that Y is open in X. 

A subset fCXis closed if its complement Y := X — Y is open. 

EXERCISE 3.2 Let A be a topological space. Show that the following properties 
hold. 

(1) Arbitrary intersections of closed sets are closed. 

(2) Finite unions of closed sets are closed. 

(3) The empty set 0 and X are both closed. 

(This exercise demonstrates that we could equally well take these axioms to be the 
definition of a topology, then define open sets to be the complements of closed sets.) 

The interior F° of a set Y is the union of all the open sets contained in Y, and the 
closure cl Y of Y is the intersection of all the closed sets containing Y . A subset 
A c X is dense in X if cl A = X. A point x e X is an accumulation point of Y 
if x e cl(F — {jc}). 

EXERCISE 3.3 Let Y be a subset of a topological space X. Show that y e cl Y if 
and only if every neighborhood of y meets Y . 

EXERCISE 3.4 Show that a set Y is closed if and only if it contains all its 
accumulation points. 

EXERCISE 3.5 Show that a single point is a closed set in a Hausdorff topology. 

An open cover of X is a collection {U a } of open sets whose union is X. An open 
cover is locally finite if, for every p e X, there is an open neighborhood U of p 
such that \{a : U D U a 0} | is finite. (Note that a locally finite open cover is not 
necessarily finite.) An open cover {Vp} is a refinement of {U a } if for all (i there is 
an a such that U a D Vp. 

A topological space X is compact if every open cover has a finite subcollection 
that also covers X (“every open cover has a finite subcover”). Intuitively, com- 
pact spaces can be thought of as being finite in extent. For example, a subset 


5 
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of Euclidean space (see Example 3.4) is compact if and only if it is closed and 
bounded (the Heine-Borel theorem). 

EXERCISE 3.6 Show that a closed subset of a compact set is compact. 

A basis for a topology on X is a collection B of subsets of X such that 

(1) every x belongs to some element of B: 

(2) given any two basis elements B\ and B 2 in B, if x g B\ D B 2 then there exists 
a B 2 such that x g c B\ D B 2 . 

The topology generated by B is just the collection of open subsets of X, where U 
is declared open if for every x g U there is an element B g B such that x g B c U. 

Example 3.2 The standard topology on R is the topology generated by all the 
intervals (a, b) := {x : a < x < b }. 

Given two sets X and Y, the product topology on X x Y is the topology generated 
by all sets of the form U x V, where U is open in X and V is open in Y. 

Example 3.3 The standard topology on R" is just the product topology on 
R x • • • x R. 

n times 

Example 3.4 Recall that Euclidean n-space E" is R" equipped with the Euclidean 
inner product g. The Euclidean distance d(x, y) between two points x and y is 
defined to be d(x, y ) = ||x — y ||. 6 The open ball of radius r about the point p is 
the set B r (p) = {x e R" : d(x. p) < r}. The set of all these open balls generates 
the Euclidean topology on E" . It can be shown that the Euclidean topology on E" 
coincides with the standard topology on R" . For this reason, most people refer to R" 
as Euclidean space! Unfortunately, this convention is so ingrained in many people 
(including this author) that it would be difficult to ignore. Henceforth, whenever R" 
is treated as a topological space it is understood that we are using the (standard) 
Euclidean topology. 

Let / : X -> Y be a map between topological spaces. The map / is continuous 
if the inverse image of an open set in Y is open in X. 

EXERCISE 3.7 Show that / is continuous if the inverse image of a closed set is 
closed. 

EXERCISE 3.8 In calculus you learn that / : R — * R is continuous at a point peR 
if for every s > 0 there exists a S > 0 such that, for all x g R, |/(x) — f(p) \ < 
e whenever |x — p\ < 8. The map / is said to be continuous if it is continuous 


6 We use the standard notation ||x|| := [g(x, A') | 1 ' where the positive root is understood. 
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at every point p e R. Show that the epsilon-delta definition of continuity and the 
inverse-image definition of continuity coincide for maps from R to R. 

EXERCISE 3.9 The topological space Y is a continuous image of X if there is a 
continuous surjection / : X — > Y. Show that the continuous image of a compact set 
is compact. 

EXERCISE 3.10 Let r be a topology on X, and let Y C X. The collection 

r y = {Tn U : U e r} (3.1) 

is a topology on Y , called the induced topology or subspace topology on Y . The 
space Y equipped with this topology is a subspace of X. Show that ry is the coarsest 
topology for which the inclusion map ; : Y c -»- X is continuous. (A topology er is 
coarser than a topology r if a C r ; r is finer than a because it has more open sets.) 

EXERCISE 3.11 Show that a subspace of a Hausdorff space is Hausdorff. 

EXERCISE 3.12 Let / : X —> Y be a continuous map of Hausdorff topological 
spaces, and suppose that X is compact. Show that is compact for y e Y . 

EXERCISE 3.13 If A is a topological space (with topology r) and ~ is an equiva- 
lence relation on X, the quotient Y X/ ~ is naturally a topological space under 
the quotient topology a, defined as 

a = {U c Y : 7t~ l (U) e r}, 

where tt : X -* Y is the natural projection map x i->- [x]. In other words, the 
quotient topology is the finest topology for which n is continuous. (Note that the 
quotient topology a does not generally preserve the properties of r. For example, 
a need not be Hausdorff even if r is.) Suppose that g : X Z is a continuous 
map of topological spaces that respects the equivalence relation so that x\ ~ xj 
implies g(xi) = gixf). Show there is a unique continuous map f : Y Z such 
that g = f on. (One says that g “descends to the quotient”.) 

A map / : X — > Y between topological spaces is a homeomorphism if it is 
continuous with continuous inverse, meaning that there is a continuous map g : 
Y X such that g o f = fog = 1. If such a pair of maps exist, we write 
X ~ Y and say that X and Y are homeomorphic (or topologically equivalent). A 
property P(X) is a topological invariant of A if A ~ Y implies P(X) = P(Y). 
That is, P(X) depends only on the topology of X. The property P(X) is a complete 
topological invariant of a space X provided that P(X) = P(Y) if and only if 
X ~ Y. Topology can be loosely characterized as the study of the topological 
invariants of spaces. 


3.2 Multivariable calculus facts 

3.2 Multivariable calculus facts 
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Let U C M' ! be an open set and suppose that / : U — > K is a function. Label the 
points of M" by the //-tuples x — (x 1 , , x"). Then the partial derivative df/dx' 
is defined by 


3/ ,. f(x + te t ) 

— r = lrm 

dx l /-> o t 


fix) 


(3.2) 


where c, = (0, . . . , 1, . . . , 0) has a “1” in the ith slot. It is sometimes conve- 
nient to use multi-index notation to denote higher-order partial derivatives. Let 
oi = (i i, ... , ik). Then 


3/ , , 3 k f 

u) := — ; 

dx a dx n ■ ■ ■ dx‘ k 


(3.3) 


A function / : M" M is C°°, or smooth, if d k f/dx 01 exists and is continuous for 
all a. The composition of smooth functions is smooth. 

Again let U C M" be an open set but now suppose that f : U R m is a map, 
given by v h* . . . , The map / is smooth if each component 

function /' is smooth. The derivative Df(x) of / at x is just the matrix of partial 
derivatives Df(x) = (d/'/S* 7 ); this matrix is called the Jacobian matrix. When 
n = m, its determinant dclCdf'/dx ') is called the Jacobian determinant or more 
simply the Jacobian of the map /. 


EXERCISE 3.14 Show that if / : U C R" R™ is linear then Df = f. (“The 
derivative of a linear map equals the map itself”) 


Now let U,Vc M" be two open sets, and let f : U — > V be a homeomorphism. 
If / and f~ l are both smooth then / is called a diffeomorphism. This brings 
us to the all-important inverse function theorem, which says that if the matrix 
representing the derivative of a function is invertible at some point then the function 
itself is a local diffeomorphism in the neighborhood of that point. 8 


Theorem 3.1 Let W C M" and suppose that f : W M" is a smooth map. If 
a e W and Df(a) is nonsingular then there exists an open neighborhood U of a 


7 Technically, the derivative Df(x) is the unique linear map T : R" — >■ R m satisfying 


lim 

m^o 


\\f(x + h)-f(x)-T(h)\\ 
W\ 


= 0 , 


where h e M. n and || x\\ is the usual Euclidean length of x. The Jacobian matrix is just the representation of this 
linear map relative to the standard bases of R n and R m . You can prove this for yourself or else look at [79], 
Chapter 2. The distinction is not really critical in Euclidean space, where the standard basis is almost always 
understood, but it will be more important later when we discuss general manifolds. By that time, though, we 
will have a fancier way of understanding the derivative map. 

For a proof, see e.g. [1 1], pp. 42ff or [79], pp. 35ff. 
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in W such that V = f(U) is open and f : U — > V is a dijfeomorphism. If x e U 
and y = /(x) then ( ) = (D/(x)) _1 . 

The importance of the inverse function theorem for us lies in what it tells us about 
coordinate systems, as we shall see in the next section. 

EXERCISE 3.15 Prove the converse of the inverse function theorem: if / : U — > V 
is a diffeomorphism of open sets then Df(x ) is invertible at all points x e U . Hint: 
Consider / o / -1 and use the chain rule. 


3.3 Coordinates 
3.3.1 Coordinate functions 

In the usual presentation of M" we write the points as //-tuples (x 1 , . . . , x"). The 
individual numbers x' are called the coordinates of the point. We are so accus- 
tomed to this presentation that we do not usually think too much about it. But there 
is a hidden conceptual confusion because the presentation conflates two different 
ideas, namely the points and their coordinates. The difficulty is resolved by think- 
ing of the coordinates as coordinate functions that assign the coordinates to the 
point. According to this point of view, x' is a function from M” to M that sends the 
point a — (ai, . . . , a„) to the number a,-. Equivalently, we write 

x'(a) = «,. (3.4) 

This seems perfectly fine. The problem comes when we want to express a point of 
M" as x = (x 1 , . . . , x"), for then we have the somewhat strange looking equation 

x'(x) = x ! . (3.5) 

In this equation, the x' on the left is a function from W to K, while the x' on the 
right is a simple number, namely the / th coordinate of the point x = (x 1 , . . . , x"). 
Unfortunately, there is no way to avoid this notational crisis, because bad habits 
have become so ingrained. You must decide from the context which meaning of x' 
is intended. Henceforth we will make every effort to avoid writing equations such 
as (3.5). 

Part of the reason why this problem is so pernicious is that we generally describe 
points in terms of their coordinates. For example, the canonical presentation of the 
unit 2-sphere S 2 defines it to be the subset of M 3 given by {x e M 3 : (x 1 ) 2 + 
(x 2 ) 2 + (x 3 ) 2 = 1}. How else would you define it? Almost every way you can 
think of uses coordinates. Even if it were possible to avoid coordinates in order to 
describe a sphere (and it is), it is essentially impossible to describe any moderately 
complicated set of points without using coordinates. After all, this is the reason why 
Descartes first introduced coordinates and why analytic geometry is so powerful. 
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Indeed, we will often refer to a point using its coordinates in some coordinate 
system. Nevertheless, we must remember in our own minds that coordinates are 
just arbitrary labels for the points. 

3.3.2 Coordinate systems 

This gives rise to a related problem, one which leads eventually to the idea of a 
manifold. The description of the 2-sphere given above uses too many coordinates. 
The 2-sphere S 2 is two dimensional, and our intuition says that a two-dimensional 
object ought to be describable in terms of two coordinates, not three. But it turns 
out that there is no single coordinate system of two coordinates that will do the job. 
Instead, we must use at least two different coordinate systems (each comprising 
two coordinates) to label the points of S 2 . 

To get a feel for the difficulty, let’s try to describe S 2 using only a single pair of 
coordinates. The obvious choice is to use polar coordinates (0,0), so that points 
on the sphere are given by 

x 1 = sin 0 cost/), 
x 2 = sin0 sin0, 
x 3 = cos 9. 

The problem lies in the range of the coordinates. For example, if we were to choose 
0 < 0 < 2n and 0 < 0 < 2 jt, some points would end up getting assigned more 
than one pair of coordinates, such as (tt/2, 0) and (37t/2, : r), both of which label 
the point (1 , 0, 0). Not only would it be confusing to have two distinct labels for the 
same object, it also makes taking derivatives impossible because a function must 
be single valued in order to be continuous and hence differentiable. To ameliorate 
this problem one usually takes the range to be 0 < 9 < n and 0 < 0 < 27T. 
But even this is no good, because the north and south poles are each labeled by an 
infinity of pairs, namely (0, 0) and (n, 0), respectively. To avoid the problem, one 
can take 0 < 9 < n and 0 < f < 2it, but then the poles are not labeled at all. 

In fact, no matter how hard you try, you will never find a single set of coordinates 
to describe the 2-sphere. The reason is that the sphere and a part of the plane are 
topologically distinct. But, as we shall see, we can label all the points of S 2 with 
two sets of coordinates because we can cut the sphere into two pieces, each of 
which can be flattened to look like a piece of the plane. 

3.3.3 Change of coordinates 

Consider the transformation in M 2 between Cartesian and polar coordinates: 

x = r cos 9, 

(3.6) 

y = r sin0. 
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Every point in the plane can be described uniquely in the (x,y) coordinate system, 
but the origin is a problem for the polar coordinate system because it is described 
by the infinity of pairs (r, 9) = (0, anything). Another way to see that something 
strange happens at the origin is to compute the Jacobian of the transformation. 
Doing so gives 


dx/dr 

dx/dO 


cos 9 — r sin 9 

dy /dr 

dy/d9 


sin# rcos0. 


(3.7) 


which vanishes at the origin. 

The inverse function theorem provides the link between these two ways of 
determining the validity of a given coordinate transformation. A coordinate trans- 
formation is a good one if there is a one-to-one correspondence between the 
two sets of coordinates and if the transformation is differentiable. In other 
words, a set of functions {/'(*)} on M" constitutes a good coordinate sys- 
tem in the neighborhood of a point x if the transformation (jc 1 , . . . , x") m<- 
. . . , x n ), ..., f n (x l , ... , x n )) is a diffeomorphism. 9 Ascertaining whether 
a map is a diffeomorphism can be difficult. But checking whether a Jacobian 
vanishes is usually easy. The inverse function theorem assures us that, as long 
as the Jacobian of the transformation is nonsingular, we can coordinatize the 
neighborhood of * with the functions /'. 


3.4 Differentiable manifolds 

This leads us at last to the definition of a smooth manifold. An n -dimensional 
smooth manifold M consists of a Hausdorff topological space together with a 
countable collection of open sets {£/, }, called coordinate neighborhoods or coor- 
dinate patches, that cover M and a collection of maps {(pi }, called coordinate 
maps, satisfying two conditions. 

(1) Each (pi : U — > M" is a homeomorphism onto an open subset of M". (We say 

that M is locally Euclidean.) 

(2) If Uj and U ) are two overlapping coordinate neighborhoods with coordi- 
nate maps <pj and tpj then tpj o cpf 1 : <pi(U , n U j) — > (pj(Uj D Uj) is 
a diffeomorphism. 10 (We say that the coordinate maps are compatible on 

overlaps.) 


9 To be fair, if we didn't want to differentiate anything we would only need it to be a homeomorphism. But we 
do want to differentiate things. 

10 If we replace the word “diffeomorphism” with the word “homeomorphism” we simply get a manifold, 
sometimes called a topological manifold. 
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Figure 3.3 Two coordinate charts and a transition function. 

Each pair (t/,-, tp, ) is called a coordinate chart, and the collection of all coordinate 
charts is called an atlas. The maps tpj o tp ~ 1 are called transition functions of the 
atlas. See Figure 3.3. 

The condition that M be Hausdorff is there basically to express our intuition of 
space as “infinitely divisible”, so that we can separate points with open sets. The 
condition that the cover be countable is there for a technical reason having to do 
with extending locally defined quantities to globally defined ones. The condition 
that M be locally Euclidean serves at least two purposes. First, it tells us that, in the 
neighborhood of a point, all » -dimensional manifolds look like a (mildly deformed) 
bit of Euclidean //-space. Second, it allows us to define local coordinates so that 
we can compute things. The compatibility condition ensures that we can patch 
together the coordinate systems consistently, so that we always end up with valid 
coordinates. 

Let (U, <p ) be a coordinate chart with p e U and suppose that <p(p) = q. If 
x 1 , . . . ,x n are the standard coordinate functions on M" then q has coordinates 
(x l (q), . . . , x n (q)). Thus we can write 

<p(p) = (x l (q), . . . ,x n (q)). (3.8) 

It is a convenient abuse of notation to view the x l as coordinate functions on U 
instead of tp(U) and to write 

(pip) = (x 1 (p), ■ ■ .,x n {p)). 


(3.9) 
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As U generally does not live in Euclidean space, (3.9) makes no sense unless it is 
interpreted to mean (3.8). (Of course, if U is a subset of Euclidean space then < p is 
just the identity map, so (3.9) makes perfect sense; this is the justification for the 
convention.) The functions x 1 , ... ,x n , viewed as functions on U, are called local 
coordinates on U. 

Let V be another coordinate neighborhood, with local coordinates y 1 , . . . , y n . If 
U 0 V 7 ^ 0 then on <p(U D V) the action of the transition function xf o tp~ l can be 
written in local coordinates as follows: 

(x 1 , . . . , x n ) i-» (^'(x 1 , . . . , x"), . . . , /’(x 1 , . . . , x")). (3.10) 

The compatibility condition (2) in the definition of a manifold (see the start of this 
section) is just the statement that (3.10) is a diffeomorphism, which, by the inverse 
function theorem, is equivalent to the requirement that the Jacobian determinant 
det (dy 1 /dx ') be nonzero. 

The sign of this determinant is important. A manifold is said to be orientable 
if it is possible to choose an ordering of the local coordinates so that the Jacobian 
determinants of the transition functions have the same sign on every pair of over- 
lapping neighborhoods . 11 If this is possible then the manifold has two opposite 
orientations, according to the choice of sign. For the most part we will concern 
ourselves only with orientable manifolds, because they are the ones that arise most 
often in applications, although we will have occasion to consider non-orientable 
manifolds as well. As orientable manifolds enjoy many properties not shared by 
nonorientable manifolds, we are often obliged to distinguish the two possibilities 
explicitly. 

Example 3.5 The Cartesian space R" is an orientable n -dimensional manifold. 
(Surprise.) 

Example 3.6 The unit 2-sphere S 2 is an orientable two dimensional manifold. To 
see this, we must cover it with a set of compatible coordinate charts, then check 
the signs of the Jacobians. 12 A traditional way to accomplish this is to cover the 
sphere with six patches, but here we employ stereographic projection in order to 
use only two. The first chart, (U, cp), is defined as follows. The open set U in R 3 
covers the entire sphere except for the north pole N. Pick a point P e U, and let 
ip(P) be the unique point Q on the z = 0 plane such that the line N Q contains P . 


1 1 In Appendix C we discuss an easier way to verify the orientability of smooth manifolds using differential 
forms. Orientations of two dimensional surfaces are the easiest to understand, especially if they are viewed 
as subsets of R 3 . In that case, one can show that a surface is orientable if it is possible to define on it a 
nowhere- vanishing normal vector field. 

12 Actually, we should also show that it is Hausdorff with a countable cover, but this follows automatically by 
equipping S 2 with the subspace topology. 
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Figure 3.4 Stereographic projection. 


(See Figure 3.4.) Observe that the upper hemisphere projects onto points on the plane 
outside the sphere while the lower hemisphere projects to points on the plane inside 
the sphere. This is called stereographic projection from N . Let S be the south pole, 
and define (V, fr) similarly via stereographic projection from .S'. 1 ' 

Define coordinates (m 1 , it 2 ) such that Q = (p(P) = tp(xp, yp, zp) = (u l ,u 2 , 0). 
The line from N to Q is described parametrically by N + t(Q — N), so to find the 
value of t corresponding to the point P we write 

( xp , yp, zp) = (0, 0, 1) + t(u l , u 2 , — 1) = (tu 1 , tu 2 , 1 — t) 


and solve for t. The point P is on the sphere, so the sum of the squares of the 
coordinates is unity, which leads to 


t 


where r/ := (if 1 ) 2 + ( u 2 ) 2 . We thus obtain 


x P 


2m 1 

IT? 


yp = 


2 ir 


and 


Zp = -- 


The patch U is defined by the range 0 < u‘ < oo, i = 1,2. The lower hemisphere 
is mapped to the disk i] < 1 while the upper hemisphere is mapped to the points 
with ;; > 1 . 


Stereographic projection is often defined from the sphere onto the tangent plane at the south pole but, as 
we wish to consider projections from the south and north pole simultaneously, the median plane is more 
convenient here. 
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Projecting from the south pole instead, we coordinatize V by pairs ( i; 1 , v 2 ), defined 
so that Q = fr{P) = f/(xp, yp, zp) — (u 1 , v 2 , 0). Repeating the argument above 
gives 


x P 


2v l 

1 +?’ 


yp 


where $ := (v‘) 2 +(v 2 ) 2 . 


2v 2 

T +?’ 


and 


zp 


1-g 

1+C 


To find the coordinate transformation valid on the overlap tp(U fl V) we start from 
(u l , u 2 ), map backwards via (p ~ 1 to the sphere to get (xp, yp, zp), and then map 
down via fr to get (v 1 , v 2 ). The way to do this is just to equate (xp, yp, zp) in the 
two coordinate systems and carry out a bit of tedious algebra until you discover that 

(v'fM 1 , u 2 ), v 2 (u l , U 2 )) = (l/f O u 2 ) = (m 1 , U 2 )/t], 

The Jacobian of the transformation \[r o <p~ l is therefore 

9 (u 1 , u 2 ) 1 

3 (w 1 , u 2 ) r\ 2 

on the overlap cp(U fl V). Evidently this is nowhere vanishing. (Also, it does not 
blow up, because the two patches do not overlap at the south pole where r] — 0.) 
Hence, by the inverse function theorem, f/ o cp~ l : q>{U fl V) —> fr(U fl V) is a 
diffeomorphism, so S 2 is a smooth manifold. As the Jacobian is everywhere of one 
sign, S 2 is orientable. 14 


Example 3.7 If you take a rectangular strip of paper and bend it around until the 
left and right edges meet then glue the edges together, you get a cylinder. But if you 
hold the left side and twist the right side through 180 degrees, then bend it around 
and glue the ends together, you get a Mobius strip (Figure 3.5). This process of 
gluing together edges is formally known as identifying the edges. Mathematically, 
the Mobius strip can be viewed as the quotient space (R x [0, l])/~, where we have 
(x, y) ~ (x + 1, — y). If you substitute R for the closed interval [0, 1] in the defini- 
tion, you get the (infinite) Mobius band B. We claim that B is a two-dimensional 
nonorientable smooth manifold. 15 

Let [x, y] denote the equivalence class of (x, y) in B. It’s a bit less confusing if we 
cover B with three patches rather than the traditional two. Let’s choose 

Ui — {[x, y] : 0 < x < 2/3, y e R}, 

U 2 = {[x, y] : 1/3 < x < l, y € R}, 

U3 = {[x, y] : 2/3 < x < 4/3, y e R}. 

If we were to list the coordinates 1 ; 1 and v 2 in the opposite order we could force the Jacobian to be positive 

on the overlap. All that we require is the same sign on every overlap. 

15 So far all our manifolds have been without boundary. The Mobius strip is an example of a (two-dimensional 
smooth nonorientable) manifold with boundary. For more information about those sorts of things you’ll need 
to read Chapter 6. 
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(a) (b) (c) 

Figure 3.5 (a) A strip of paper, (b) A cylinder, (c) A Mobius strip. 

Define coordinate maps tpi : [/,■ — > M 2 by tpi([x,y]) = (x, y), where (x, y) is 
the unique representative of [x, y] in the parameter range corresponding to {/, . For 
example, ^([1/3, y]) = (1/3, y) but ^([4/3, y]) = <f>i([l/3, -y]) = (1/3, -y). 

Now consider the transition maps. On U\ fl U 2 we have (tp\ o tpf x ){x, y) = 
(x, y), because tp\([x, y]) = q> 2 ([x , y]). For example, ([1/2, y]) = (1/2, y) = 
^*2 ([ 1 /2, y]). The same thing happens on U 2 fl t/3 . In particular, both these transition 
maps have Jacobians equal to 1. But on {/j fl t/3 something unexpected happens. For 
example, <p~ x { 1/6, y) = [1/6, y] = [7/6, -y] so (503 o ^j _1 )(l/6, y) = (7/6, -y). 
In general, (<£>3 o qsj^Xx, y) = (x + 1, — y). Observe that the Jacobian of this 
transformation is 


3(x + l)/3x 

3(-y)/3x 


1 0 

3(x + l)/9y 

d(-y)/dy 


0 -1 


All the Jacobians are nonvanishing, so all the transition maps are diffeomorphisms. 
Hence B is a smooth manifold. But the Jacobians have different signs, so we cannot 
conclude anything about orientability. By thinking about this example a bit harder, 
though, you should be able to convince yourself that you will always get a sign 
mismatch for any set of coordinate charts. (Basically the reason is that any covering 
has to include an open set that contains points of the form [0, y], and on these the 
Jacobian always flips sign.) Hence the Mobius strip is not orientable. (For a slightly 
more rigorous proof, see [27], Section 2.8.) 

EXERCISE 3.16 Let Z" be the integer lattice in M", namely the set of all points 
whose coordinates are integers. The n-torus T n is defined to be the quotient space 
T n = M"/Z' ! . For example, the 2-torus T 2 is the set of all points (x, y) e M 2 such 
that (x + l,y) ~ (x , y ) ~ (x,y+l). Equivalently, it is the set of all points in the unit 
square with outside boundary points identified as follows. First identify the top and 
bottom boundaries of the unit square along the arrows to get a cylinder, then identify 
the left and right boundaries along the arrows to get the 2-torus (a doughnut). (See 
Figure 3.6.) 
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Figure 3.6 The 2-torus obtained via identifications. 

Show that T 2 is an orientable manifold. Hint: You may assume that T 2 is Hausdorff 
with a countable cover. (If, however, you are itching to prove this, you may want to 
revisit the notion of a quotient topology.) By analogy with Example 3.7, cover T 2 
with nine patches. For instance, take 

U\ = {[x, y] : 0 < x < 2/3, 0 < y < 2/3}, 

U 2 = {[x, y] : 0 < x < 2/3, 1/3 < y < 1}, 

U 3 = {[x, y] : 0 < x < 2/3, 2/3 < y < 4/3}, 

C /4 = {[x, y] : 1/3 < x < 1, 0 < y < 2/3}, 

etc., together with the analogous coordinate maps. 


EXERCISE 3.17 Real projective n-space RP" consists of the set of all lines through 
the origin in R" +1 . If x is a nonzero vector in R' !+1 , let [x] denote the line deter- 
mined by x. Clearly, [x] = [y] if and only if x = Ay for some /, f 0. Let 
JJi = {[x°, x 1 , . . . , x' 2 ] : x ! / 0} C RP" be an open subset of RP" and, for 
0 < i < n, define coordinate maps cpi : Ui — > R" by 


tPi ([x°, X 1 , ...,X"]) 




Show that the pairs (Ui, tp, ) define a set of compatible coordinate neighborhoods, 
thereby making RP" a (smooth) n-dimensional manifold. What can you say about 
the orientability of RP" ? Remark: The tricky part about this problem is finding the 
correct inverse maps. Technically, you should really show that RP" is Hausdorff with 
a countable cover in the natural (quotient) topology, but you can skip this part. 


3.5 Smooth maps on manifolds 

The definition of the derivative given in Section 3.2 uses the linear structure of 
Euclidean space in a crucial way, and there is no way to define something similar 
for a general curved space. Instead, we use the existence of the coordinate maps on 
a smooth manifold to define what we mean by differentiability. 
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Specifically, a function / : M — > R is smooth at p e M if, for any chart ( U , (p) 
with p € U, the map 

f'-focp- 1 : <p(U) M (3.11) 

is a smooth function in the usual Euclidean sense near (pip). This definition is 
independent of coordinate chart by the very definition of a manifold for, if ( V, VO is 
some other chart with p € V then, astjro(p~ l is smooth, fotp~ l = /oi/f -1 oi/ro<p -1 
is smooth if and only if / o x//~ l is smooth. The space of smooth functions on M is 
denoted £l°(M). Once again, if x 1 , . . . , x" are local coordinates near p, we abuse 
notation and write f — f(x 1 , ... ,x n ). 

The same idea applies to more general maps. Let M and N be two smooth man- 
ifolds of dimensions m and n, respectively. A map / : M — N is smooth if for 
every p e M there exist charts ( U , tp) on M and ( V , \jf) on N, with p e U and 
f(U ) C V such that 

f,= ^ofotp~ { :<p(U)^ ff{V) (3.12) 

is a smooth map of Euclidean spaces. A smooth map f : M N is a diffeo- 
morphism if / -1 exists and is smooth, in which case we say that M and N are 

diffeomorphic. 

If x 1 , . . . , x" are local coordinates near U and y 1 , . . . , y m are local coordinates 
near V then we should write 

/(x 1 , . . . , x m ) = (/‘(x 1 x m ), ...,f\x\..., x m )), (3.13) 

where f‘=y'o /, but once again we adopt the conventional but sloppy notation 
and write instead 16 

fix 1 , ...,x m ) = (y l (x l , ...,x m ), . . . , y n (x 1 , ...,x m )). (3.14) 

Writing it in this way makes it appear that / : M — »■ N is an ordinary multivari- 
able function on Euclidean space and in a sense it is, via the coordinate maps in 
a neighborhood of a point. For this reason we will use the notation (3.14) in all 

16 Although (3.10) and (3.14) look similar, they have different meanings. The former is a change of coordinates 
between two coordinate neighborhoods of the same point on M, whereas the latter is a map between a 
neighborhood of M and a neighborhood of N. But there is a sense in which they can be considered to be 
equivalent, which is related to the notion of active and passive transformations. The idea is illustrated in 
Figure 3.7. Let M be a disk in the plane, equipped with the usual Cartesian coordinates. According to the 
active point of view, we leave the coordinate system alone but deform the disk until it looks like N. A point 
p e M with coordinates (a, b ) is mapped to a point q e N with coordinates (c, d). According to the passive 
point of view, we leave the points of M alone, but deform the coordinate system so that the coordinates of 
p change from ( a , b ) to ( c,d ). As with optical illusions either viewpoint is legitimate, but keeping both in 
your mind at once will drive you crazy. The problem stems from the use of coordinates, so it is best to avoid 
their use whenever possible. Unfortunately, one does not always have this luxury. To avoid confusion from 
the start, you are generally better off distinguishing the interpretations of (3.10) and (3.14). 
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Figure 3.7 Active and passive interpretations of a diffeomorphism. 

that follows, remembering that it is only true locally, and that it employs various 
notational liberties. 

Remark Because smooth maps of manifolds reduce to smooth maps of 
Euclidean space, the inverse function theorem works for manifolds in the same 
way as it does in Euclidean space: if the Jacobian matrix of / is nonsingular then 
/ is a local diffeomorphism. In particular, M and N can only be diffeomorphic if 
m = n. 


Remark In multivariable calculus one defines curves and surfaces in R 3 by 
means of a parameterization. For example, a curve is a map y : I — > R 3 
where I is an open subset of R given by y(t) = (x l (t), x 2 (t), x 3 (t)). A sur- 
face is a map a : U — > R 3 , where U is an open subset of R 2 given by 
a(u, v ) = (x l (u, v), x 2 (u, v ), x 3 (u, v )). A coordinate map of a manifold can be 
thought of as a kind of inverse parameterization. More precisely, if U is an open 
subset of R" and xf : U — »■ M is a diffeomorphism onto an open set V C M then 
(U, xjf) is a parameterization of V and (V, ~ 1 ) is a coordinate chart. 

3.6 Immersions and embeddings 

Let M and N be smooth manifolds of dimensions m and n, respectively, and let 
/ : M — > N be a smooth map, represented in local coordinates by (3.14). If m < n 
and the Jacobian matrix (dy l /dx-') of the transformation (3.14) has maximal rank 
(namely m ) at p e M then / is called an immersion at p, and if / is an immer- 
sion for all p € M then M is an (immersed) submanifold of N. The definition 
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Figure 3.8 (a) An immersed circle, (b) An embedded circle. 

makes sense, because if the Jacobian has maximal rank then f(M) is locally coor- 
dinatizable according to the inverse function theorem. If m > n and the Jacobian 
of the transformation has maximal rank (namely n) at p e M then / is called a 

submersion at p. 

An injective immersion / is called an embedding (or imbedding), provided 
that / maps M homeomorphically onto its image f(M) (in the induced topol- 
ogy). The basic difference between immersions and embeddings is that the image 
of an immersion can have self-intersections whereas the image of an embedding 
cannot. (See Figure 3.8.) The celebrated Whitney embedding theorem says that 
any n -dimensional topological manifold can be embedded in M 2n+1 , and any 73- 
dimensional smooth manifold can be embedded in M 2 ". For example, the Klein 
bottle is a smooth 2-manifold obtained in the same manner as the torus in Exam- 
ple 3. 16 except that the arrows on the right and at the top of the square are reversed 
prior to making the identifications. If you try to construct the Klein bottle in M 3 you 
will observe that you cannot do it without creating self-intersections somewhere 
(Figure 3.9). But, thanks to Whitney’s theorem, we know that it can be embedded 
in M 4 . Of course, the dimension 2 n is only an upper bound. For example, we know 
that the 2-sphere can be embedded into M 3 . 

Whitney’s result means that, without loss of generality, we could simply treat 
manifolds as living in a large Euclidean space. The reason we do not do this is 
primarily aesthetic. Mathematicians strive for parsimony when making definitions, 
and it is inelegant to define an object with reference to something outside itself. 
The intrinsic description of a manifold also has physical utility. Einstein modeled 
the universe as a smooth manifold of a certain type. By definition, the universe 
is everything. What could it possibly mean to embed the universe in something 
outside itself? 

We can use the ideas of this section to write down a very handy criterion for 
obtaining manifolds from maps. Suppose that / : M — > N with m > n. If/ 
is a submersion at p e M then p is called a regular point of /, otherwise it is 
a critical point of /. A point q € N is a regular value of / if every point in 
f~ l (q) is regular. 1 By famous theorems of Brown and Sard, the regular values of 


17 This means that q is automatically a regular value if / *(t/) is empty. 


72 


Differentiation on manifolds 



Figure 3.9 A Klein bottle immersed in R 3 . (Source: Wikipedia) 

any smooth function / are dense in N, so in this sense almost all the points of N 
are regular values. 

Theorem 3.2 (Regular value theorem) Let f : M — > N be a smooth map of 
manifolds, with dimM = m and dim N = n. Let m > n and let q & N be 
a regular value of f. Then / _1 (q) is a smooth embedded submanifold of M of 
dimension m — n. 

Proof Let y 1 , . . . , y n be local coordinates in a neighborhood L of q, with y‘ (q) = 
0. In U = f~ l (V), consider the functions /' := y' o f for 1 < i < n. The first 
claim is that f l ,...,f n can be completed to a set of good coordinates on an open 
subset of U . To do this, let x\, ... , x m be good coordinates in a neighborhood 
of p e U , and consider the entire set {/*, . . . , /", x 1 , . . . , x m }. By hypothesis 
the Jacobian [3/ ! /3x ; ]i<i< n , i</<m has rank n so, by linear algebra, some n x 
n minor of the Jacobian matrix is nonzero. Without loss of generality, suppose 
it is the principal minor indexed by the first n columns. Discarding x 1 , . . . ,x n 
from our list we are left with the set {f l , x" +1 , . . . , x m }. The Jacobian of 

the coordinate transformation {x 1 , . . . , x m j { / 1 , x" +1 , . . . , x m }, which 
we may write as [df'/dx-' dx k /dx-' \\<i< n . n +\<k<m. ]<j<nn is then nonzero, so by 
the inverse function theorem the coordinate transformation is a diffeomorphism 
(possibly on some smaller neighborhood U' of p), and the first claim is proved. 

The second claim is that a subset of these coordinates provides good coordinates 
for f~ l (q). Specifically, note that for all p e U' we have /' ( p) = 0. The map p t-> 
(0, . . . , 0, x" +1 (p), . . . , x' n (p)) is manifestly a homeomorphism from IP onto its 
image in W”~". It remains only to show that the transition functions for charts of 
this form are diffeomorphisms. So, let U " be some other open set in M containing 
p, with coordinates (g 1 , . . . , g n , z" +1 , . . . z m ) constructed as above, and consider 
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the coordinate transformation represented symbolically by (/, x) h* (g, z). The 
Jacobian determinant of this map can also be written symbolically, as 

dg/df dg/dx 

dz/df dz/dx 

and it is nonzero by the first claim. But the g‘ are different coordinate functions 
representing the map /, so everywhere on / -1 (q) we have g' (0, x) = 0. In partic- 
ular, dg l / dx-i = 0 on It follows that det(3z/3.r) ^ 0, so the second claim 

is proved. □ 

Remark As points are closed in a Hausdorff space, f~ l {q) is a closed subset of 
M. 


EXERCISE 3.18 Using Theorem 3.2, show that the n-dimensional sphere S" is a 
manifold. 
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To motivate what follows, let us return to Euclidean space for a moment. Let \p 
be a scalar field on M 3 and let y : I — > M 3 be a parameterized curve given by 
t t-* (y 1 ( t), y 2 (t ), y 3 (0) = U 1 (t), x 2 (t), x 3 (t)). Then 


j-'l'iYit)) 

at 


dxp dx 3 xp dy dxp dz dy 
dx dt dy dt 3 z dt dt 


■ Vt/r, 


(3.15) 


where V is the gradient operator (d/dx, d/d y, d/d z) and dy /dt is the tangent vec- 
tor to the curve. Equation (3.15) is the directional derivative of xp in the direction of 
dy /dt. It tells us how xp changes as we move along the curve. But xp is arbitrary; 
if we were to strip away ip- from both sides of (3.15) we would get 


d_ = dy_ v 
dt dt 


(3.16) 


Equation (3.16) gives us a clue how the idea of a tangent vector can be gener- 
alized from Euclidean space to a general manifold. We cannot situate arrows or 
straight lines inside an arbitrary manifold (because a general manifold is not a vec- 
tor space), but we can retain the essence of a tangent vector as something that tells 
you how a function changes as you move in that direction. So instead of dy /dt, 
which lives in Euclidean space, we now view d/dt itself as the tangent vector to 
the curve. This perspective can be a bit confusing at first, because y seems to have 
disappeared altogether. Nevertheless, y is still there because d/dt makes no sense 
as a tangent vector in the absence of a curve. 18 


18 


For a detailed explanation of the correct way to understand d/dt , see the discussion of integral curves in 
Section 3.16. 


74 


Differentiation on manifolds 


Example 3.8 As a simple illustration of the utility of this perspective, consider 
again the transformation (3.6) between Cartesian and polar coordinates in the plane. 
By considering the effect of the gradient operator we see that the vector 3/3x (respec- 
tively, 3/3 y) points in the direction of increase of the x (respectively, y) coordinate. 
We set the length of each vector to unity and therefore write 

9 H 9 

ex = 7 - and e y = — 
ox dy 

for the usual orthonormal basis of R 2 . 


From (3.6) and the chain rule we compute 


3 dx 3 3 y 3 
dr dr dx dr dy 


3 . „ 3 

cost? b sinw — , 

3x dy 


3 

d9 


dx 3 dy d 
d9fa + dddy 


3 3 

-r sin$ H r cos 9 — . 

dx dy 


The vector 3/3 r evidently has unit length, so it must be the unit vector e r in the r 
direction. The vector 3/3 9 points in the 9 direction (i.e., the direction in which 9 
increases), but it must be scaled by r to yield a unit vector. Thus we have 


e r = cos 6 e x + sin 9 e y 
eg = — sin 9 e x + cos 9 e y . 

The reader should verify that these are indeed the correct unit vectors. (See 
Figure 3.10.) 


EXERCISE 3.19 The coordinate transformation from spherical polar to Cartesian 
coordinates is given by 

x = r sin 0 cos</>, 
y = r sin 9 sin </> , 
z = r cos 9. 



Figure 3.10 Cartesian and polar unit vectors. 
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By analogy with the previous example, show that the unit vectors in spherical polar 
coordinates are given by 


e r = sin 0 cos 0 <? x + sin 6 sin 0 e y + cos 6 e z , 
eg = cos 0 cos 0 e x + cos 0 sin0 e y — sin# e z , 


e^ = — sin 0 e x + cos 0 e y . 


To extend these ideas to a general manifold M, we define a tangent vector X p 
at a point p e M to be a linear derivation at p. This means that, for a, b e IF and 
/,ge X p : £2°(Af) R satisfies 

(1) linearity: X p (af + bg ) = aX p (f) + bX p {g), and 

(2) the Leibniz property: X p (fg) = g(p)X p (f ) + f(p)X p (g). 

The vector space T p M generated by all the X p is called the tangent space to 
M at p. Although the tangent space T p M is an abstract linear space attached to 
M at p, we often imagine it to look like the picture in Figure 3.11, because this 
agrees with our intuitions for a manifold embedded in Euclidean space. Still, the 
picture is somewhat misleading (because manifolds are defined independently of 
any embedding); a better way to think of the tangent space is in terms of something 
called the “tangent bundle”, which we will describe in Example 7.3. 

Theorem 3.3 Let v 1 , . ... x" be local coordinates in a neighborhood U of a 
point p e M. Then T p M is spanned by the tangent vectors !)/c)x 1 , . . . , d/dx". 
In particular, dim T p M = n. 

Proof Without loss of generality we may assume that (x l (p), . . . ,x n (p)) = 
(0, . . . , 0) = 0. Let X p be a tangent vector at p, and define X' := X p (x‘). As 
discussed in Section 3.5, in U the smooth map / : M — »■ K can be written as 




Figure 3.11 The idea of the tangent space at a point. 
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which means that we can apply Taylor’s theorem with remainder. Writing x = 
(x 1 , . . . , x") we have, in the neighborhood of p, 


f(x) = /( 0) + ^x'g ; (x) with gi( 0) = — 


(3.17) 


As usual, we identify the constant /( 0) with the constant function / (0) ■ 1, where 1 
is the unit function. The tangent vector axioms imply that X p acting on a constant 
function vanishes, 19 so, applying X p to the equation on the left in (3.17) yields 
(remembering that x' (p) = 0) 


Xp(f) = X p (fm + Y J Xp{x i )g i { 0) + £(0 )X p {gi) 

i i 


= o + Tx 1 X 

dx' 


+ 0 , 

p 


from which we conclude that 





(3.18) 


It follows that the tangent vectors d/dx 1 form a basis for T p M, which is therefore 
n -dimensional. The scalars X‘ are the components of X p in the coordinate system 
xi, . . . , x n (or, in the basis d/dx'). □ 


Remark Applying Theorem 3.3 to M = M" allows us to identify T p W with M" 
itself, because points and vectors are naturally identified in M" (cf. Figure 3.10). 


If {y 1 , . . . , y n ] is another set of local coordinates in a neighborhood of p, Theo- 
rem 3.3 says that the d/dy' constitute an equally valid basis of T p M. According to 
the chain rule, 


3 3x ! 3 

3 yj 3 yj dx 1 


(3.19) 


The key fact to remember from this result is that the change of basis matrix taking 
us from one set of coordinate basis vectors to another is precisely the (inverse of 
the) Jacobian matrix. The vector X p may be expanded in the d/dy J basis as 


x, = E rl 

j 


3 

3 yj 


(3.20) 


19 


Proof: By the Leibniz property applied to the unit functions / = g = 1, X p (l) = X p {\ • 1) = 2X^(1) 
X p {\) = 0, so by linearity, X p (c • 1) = cX p ( 1) = 0 for any constant c. 
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for some components Y K Substituting (3.19) into (3.18) and comparing with (3.20) 
gives 

Y* = y>'- % 

9.*' 


(3.21) 


A vector field X on M is a smooth map p m<- X p . In terms of the local 
coordinates x\, ... ,x n . 


V - , d 

X = > X'(jc) — r, 
^ 9*' 


(3.22) 


where the X' are now smooth functions, as opposed to constants. If {yi, . . . , y n } is 
another set of local coordinates, and if we write 


x = J^yj(v) — r , 

^ dyJ 


then the same argument as above gives 


Y j {y{x)) = Y J X i {x) 


dy’ 

dx‘ 


(3.23) 


(3-24) 


As the indices are starting to proliferate, we will henceforth employ the Einstein 
summation convention whenever convenient. In particular, we write 


9 

X = X' — 7 
dx‘ 


in lieu of (3.22) and 


instead of (3.24). 


Y j = X 


,9y 

dx‘ 


EXERCISE 3.20 Let X and Y be two vector fields. The Lie bracket [X, Y] of the 
two vector fields is defined to be the differential operator 

[X, Y] := XY - YX. (3.25) 


(a) Show that the Lie bracket of two vector fields is a vector field. Hint: Just show 
that [X, Y] is a linear derivation. 

(b) If, in local coordinates, 20 


show that 


, 9 

X = X' 7 

dx 1 


and 


Y = Y j r, 

dxJ 


Warning: In (3.23) we wrote Y J for the components of the vector field X in the 9/3 yj basis. But in this 
exercise Y 1 denotes the components of the vector field Y in the d /‘dx 1 basis. The two Y 1 ’s mean very 
different things, so don't confuse them. 
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( 3 . 26 ) 



9 

9x7 


(c) Show that the Lie bracket of vector fields X, F, and Z obeys the Jacobi identity: 


[X, [F, Z]] + [Y, [Z, X]] + [Z, [X, F]] = 0. 


( 3 . 27 ) 


Hint: Do not use local coordinates. Instead, expand each term as a sum of 
products of the form XFZ and cancel terms. 

Remark: A Lie algebra is a vector space a equipped with a binary operation [• , •] : 
a x a — > a that (i) is bilinear, (ii) is antisymmetric, and (iii) satsifies the Jacobi 
identity. The above exercise shows that the set of all vector fields on a manifold 
forms a Lie algebra. 

It is important to note that the tangent space T p M is defined independently of any 
coordinate system. Thus, at each point we are free to pick any basis [erf, not just 
a coordinate basis {9/9x'}. Any smoothly varying basis on M is called a frame 
field. 1 Generally, frame fields do not exist everywhere on M; they clearly exist 
locally, because e, = d/dx' is an instance. In terms of a frame field {<?,•}, we could 
write a vector field X as 


X = X'e h 


(3.28) 


Note that, if e,- ^ d/dx' then the X' in (3.28) is not the same as X(x ! ). 

Two important questions arise. First, when does a frame field exist globally on 
Ml Second, given a frame field {a}, can we find a local system of coordinates {x 1 } 
such that ei = d/dx‘1 We will return to these questions below. 22 

If {el} is any other frame field then there is a nonsingular matrix A (actually, 
a matrix of smooth functions) relating the two frame fields. Right away we have 
a notational problem, stemming from two competing but equally entrenched con- 
ventions. We observed in Section 1.7 that if A : V — »■ V is a change of basis map 
then 



(1.17) 


The problem is that, if {e , } is a frame field then each e,- is a tangent vector and 
therefore a linear derivation. The matrix entries A'j are functions, so the unwary 
reader might read (1.17) as saying that e acts on A as a derivation, which is not 

- It is also called a moving frame (in French, repere mobile ), especially when restricted to a curve. This 
terminology is due to Elie Cartan. 

22 See Sections 7.1 and 8.1.1, respectively. 
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true. For example, if e, = d/dx' and e\ = 3/3 y‘ then, to maintain consistency with 
(1.17), (3.19) must be written 


so that 


3 _ 3 dx' 
dyj dx l dyJ 



dx‘ 

dyJ 


(3.29) 


(3.30) 


Fiowever, from calculus we know that the x derivatives do not act on the Jacobian 
matrix in (3.29), so we just have to remember that the same convention holds in 
(1.17). 

We could avoid this problem by writing all the derivations on the right. For 
example, we could leave (3.19) alone and instead write (1.17) as 


4 = A ‘j 


(3.31) 


but then this messes up our shorthand notation (1.35), because it would conflict 
with the standard convention (1.13) for matrix multiplication. One way out of this 
latter difficulty would be two write (1.17) as 


e) = A/e,, (3.32) 

and some writers do this, but this conflicts with the standard convention that the 
first index represents a row of the matrix so we shall avoid it. Instead, we will stick 
with the standard linear algebra conventions (1.17) and (1.35) and just remember 
that although e, is a derivation it does not act on the matrix elements A ' . 


3.8 The cotangent space T/M 

The dual space to T p M is called the cotangent space and is denoted T/M. In 
local coordinates {x 1 } around p, T p M is spanned by the n basis vectors d/dx' . The 
corresponding dual basis vectors are denoted dx‘ . By definition we have 


(-• 
\ dx‘ 




(3.33) 


Clearly, dim T/M = n. A general element a p of T/M is a linear combination of 
the basis elements: 


a p = ajdx', (3.34) 

where the a, are constants. A (differential) 1-form or (smooth) co vector field a 
on M is a smooth assignment p — »■ a p . In local coordinates around p we have 

a = at (x) dx l , 


(3.35) 
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where the (jc) are smooth functions on M. If {>7} is another system of local 
coordinates near p then, as you can demonstrate in the next exercise, 


a. = bj{y) dy ] 

(3.36) 

where 


dx^ 

bj(y ) = ai(x(y)) — . 

3 yJ 

(3.37) 

Equivalently, 


3 yj 

ai(x) = bj(y(x)) — . 

(3.38) 

In particular, we can write 


3 yj 

« = bj(y)—dx‘, 
dx' 

(3.39) 

which shows that 


dy j 

dy ] = — - dx' , 
dx 1 

(3.40) 

a reassuring result. 


EXERCISE 3.21 Prove that (3.37) is equivalent to (3.38) and that both follow from 

(3.24). Hint: Consider ( X , a). 


Just as the tangent space does not depend on any particular coordinate system 
for its definition, so neither does the cotangent space. If {e , } is a generic basis for 

T p M then the corresponding dual basis {0‘ } is called a coframe field. 

Thus 

( ei,V) = 8{ . 

(3.41) 

In terms of the coframe field, an arbitrary 1-form can be written as 


a = a, O' . 

(3.42) 

If {6 fl } is another coframe field, related to the first by 


7 

II 

(3.43) 

then 


a = a'fi" 

(3.44) 


where 


3. 9 The cotangent space as jet space* 
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In the previous section we defined a differential 1-form as a smoothly varying map 
on M that assigns to every point p of M an element of the cotangent space at p. 
Although this is correct, it is perhaps rather unsatisfying because the cotangent 
space is defined in terms of the tangent space, which is itself defined as a space of 
derivations on functions. There is, however, an elegant way to define the cotangent 
space directly, using just the notion of a smooth function, which is perhaps a bit 
more natural from the point of view of manifold theory. It uses the idea of a “jet”. 

Let / : M — > M be a smooth function, p g M, and {x ' } local coordinates around 
p. We say that / vanishes to first order at p if df/dx' vanishes at p for all i. This 
is a well-defined notion (that is, independent of the coordinate system) because if 
{y i} is any other coordinate system about p then 


d 

dyJ 

3 

dx‘ 

TO 

1 

P d y j 


The Jacobian matrix is nonsingular and thus invertible, so df/dx 1 vanishes for all 
i if and only if df / dy J vanishes for all j. Inductively, for k > I we say that / 
vanishes to /Ah order at p if, for every i, df/dx' vanishes to (k — l)th order at p, 
where / vanishes to zeroth order at p if f(p) = 0. Put another way, / vanishes to 
kth order at p if the first k terms in its Taylor expansion vanish at p. For example, 
x vanishes to zeroth order at 0, x 2 vanishes to first order at 0, x 3 vanishes to second 
order at 0, etc. 

For k > 0 let M p denote the set of all smooth functions on M vanishing to 
(k — l)th order at p, and set M p := £2°(M) and M p := M p . Each M p is a vector 
space under the usual pointwise operations, and we have the series of inclusions 2 1 

M° p d M 1 p d My ■■ . 

We now define T*M, the cotangent space to M at p, to be the quotient space 24 

T*M = M p /Mj y (3.46) 

An element of T*M is called a differential 1-form at p. 

The reason that this definition is so elegant is that it uses only minimal ingre- 
dients, namely the notions of a smooth function on a manifold and its order of 
vanishing at a point (which is a smooth, coordinate independent, notion). It offers 
a way of understanding 1 -forms intrinsically, without reference to derivations or 

- 3 This series is called a graded filtration. The word “filtration” suggests a kind of sieving as we proceed down 
the chain, while the word “graded” means that the filtration carries a natural product structure that respects 
the indexing. Specifically, M p • M s p C M p +S . 

24 It is a specific example of what is called a jet space. 
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other additional structures. Moreover, all the usual properties of 1 -forms can be 
shown to follow easily. 

For instance, let / e M p be a smooth function on M. Then the differential of 
/at p, denoted df p , is the element of M p /M 2 given by the equivalence class of 
/ — f(p) ■ 1, where 1 is the unit function. Symbolically, 

df P = [f-f(p)- 1]. (3.47) 

From this we obtain the next result. 


Theorem 3.4 Let x ] ..... x" be local coordinates in a neighborhood U of a 
point p e M. Then T*M is spanned by the l-forms dx x p , . . . , dx' p . In particular, 
dim T*M = n. 

Proof 25 Let / be a smooth function on M. Choose local coordinates x 1 , . . . , x" 
in a neighborhood of a point p e M and set x = (jc 1 , . . . , x n ) as before. Without 
loss of generality we can choose our coordinates so that x(p) = 0. Then, near p, 
Taylor’s theorem with remainder gives 

(/ - f(p ) • 1)C0 = y~] ajX 1 + 'Y^a ij (x)x , x J , 

i U 

where a, = df/dx' \ p . But the quadratic term vanishes to first order at p, so 
[/ - fip) ■ 1] = [a;* ! ] = \af x l - x‘(p) ■ 1)], 


which implies that 

df p = ai dx' p . 

Flence the dx' p span T* M . To see that they are also linearly independent, suppose 
that A., dx'p = 0. Then kiX 1 vanishes to first order at p (i.e., k,x' g M 2 ), which is 
only possible if all the A.,- vanish at p. □ 


The proof shows that 


3 / , 

dfp = x—dx 
dx 1 


(3.48) 


which agrees with the familiar differential from calculus. In particular, the Leibniz 
rule 


d(fg) P = g P ■ df p + f p ■ dg p 


25 


The reader will notice some similarities between this proof and that of Theorem 3.3 (see (3.17)). This is not 
accidental, of course, as both constructions essentially reduce to Taylor’s theorem. 
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follows immediately from the ordinary rule. Moreover, if y 1 , . . . , y n is another 
coordinate system near p then, as the definition of df p is coordinate independent, 
we have 


dx' 


df ,, j 

= -^- d y J 

3 yJ 


Applying this to / = y ■' gives 


dyi = — r dx' 


dyj 

dx 1 


namely the usual change of variables formula. 


Remark One can now define the tangent space to be the dual of the cotangent 
space. Not surprisingly, the tangent space so defined coincides with the previous 
definition in terms of derivations. 


3.10 Tensor fields 

There are many different ways to define a tensor field on a manifold, but the essen- 
tial idea is simple enough: a tensor field is just a smooth assignment of a tensor 
to every point of M. The problem lies in the means by which one expresses this 
fact. As discussed in Section 2.4, we can view a tensor either as an element of a 
certain tensor product space or as a multilinear map. A similar duality holds for a 
tensor field. Both perspectives are used throughout the literature, and each is worth 
knowing. 26 


3.10.1 Tensor fields as elements of a tensor product space 

We begin by expressing a tensor field in terms of certain tensor product spaces. 
Choose local coordinates x 1 , . . . , x" on a patch U of a manifold M. Then, for any 
p e U, the vector fields d/dx l and the 1-form fields dx 1 constitute dual bases for 
T p M and T*M, respectively. These combine to provide a basis 

d 9 

— 7- ® • • • ® — - ® dx^ ® • • • ® dx’ s (3.49) 

dx li dx‘ r 

for the tensor product space ( T p M)® r ® (T*M )® S , and any element T of this space 
can be expanded in terms of this basis: 

9 9 

vp _ vj/'i — -(g)... (g) — - <g )dx h ® ■ • • ® dx js . (3.50) 

dx li dx' r 

- f) Actually, the fanciest way to view a tensor field is in terms of something called a tensor product bundle. For 
this, see Example 7.5. 
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A tensor field 'I' of type (r, s) is a map from M to Tf such that, on any coordinate 
patch U, the components vf ,ni2 " ,r jij 2 -j s are smoothly varying functions.- 7 

If {y 1 , . . . , y n } is a set of local coordinates on a patch V that overlaps with U 
then 


_3_ _ 

dx‘ dx' 3 y k 


and dx' = ^——dy k . 


dx' 
d y k 


(3.51) 


So, writing 'P'1'2 "' l ' T j' j'—j' s for the components of ^ in the y coordinates, we must 
have 


d d 

l ' r jf-fyrr ® ® ® dy j] <8> • • • <8> dy j ' s 


dy‘ 


dy'r 


— Vp'l 

— xj/il-'r 


3 


J1 "' Js dx 1 - 
dy'i 3 


dx' 1 


dx jl 


dy'r 3 


dx^ 


Jl-Js 


dx' 1 3y‘i 


dx' r dy'r dy J i 


dx n , 

-dy 'i < 8 > 


3jc* 


dy 


—dy h . 


(3.52) 


Comparing the first and last lines of (3.52) gives the transformation law for tensor 
field components: 


Vl/'l-'r , — U/'I-It- . 

> ■■■■>- ~ J1 - Js dx‘ l 


3yh 3y' r dx jl dx js 
dx'- Qyj[ dyJs 


(3.53) 


The components are each considered to be functions on their respective coordinate 
systems. If we wished to express both sides as functions of the same coordinates, 
say the jc’s, we would have to write 


**'"'*' to) = 


Jl -Js 


(x) 


dy'i 

dx' 1 


dy'r dx* 

dX h 3 yj[ 


dx Js 
dyi'r ’ 


(3.54) 


where it is understood that the derivatives also are evaluated at x or y(jt) as 
appropriate. 


EXERCISE 3.22 Not every object with indices is a tensor field. Let X = X'd/dx' 
be a vector field. Show that the functions dX' /dx J cannot be the components of any 
tensor field, by showing that they do not obey (3.53). 


Of course, the components of tp are different in a different basis. In order to 
distinguish the components of 4> in a coordinate basis from those in a more general 


27 


In this subsection and the next we use ^ rather than T to denote a general tensor field, so as not to confuse 
typ, the tensor field at a point p , with T p , the tangent space at p. 
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basis, it is traditional to use different letters for the dummy indices. Thus one writes 
[e a } for a generic frame field and \0 a } for the corresponding coframe field, in which 
case the tensor field T can be written 


ip = ^ - a '- bv .. bs e ai ® • • • (8) e flr ® 6 bl <g> • • • ® 9 b ‘ . (3.55) 

Under a change of basis 

e' a = e b A\ and 0' a = (A~ l ) a b 6 b , 

we have 


vp a r 


= ^ ai ' ar b v -bM~ l ) a[ 


a 1 ---(A- 1 ) a ''a r A b ' b[ ---A b ° b ,. 


(3.56) 


The only difference between (3.56) and (2.12) is that the tensor components and 
matrix elements are now smooth functions on M. 


3.10.2 Tensor fields as function linear maps 

If you dislike indices, you can take some comfort in the fact that a tensor field can 
also be viewed as a smoothly varying multilinear map. Let Tf (p) be the space of 
all multilinear maps on 

T*M x • • • x T*M x T p M x • • • x T p M . 

r times s times 

A tensor field T of type (r, s) is then a smooth assignment of an element e 
Tf (p) to each p e M. “Smooth” in this context means that, for any smooth covec- 
tor fields ai, ... ,a r and smooth vector fields Xi, . . . , X s in a neighborhood of p, 
the map 

P ^ Wpiaiip), ..., a r (p), Xfip), X s (p )) (3.57) 

is smooth. 

The map (3.57) depends only on the values of the covector and vector fields at 
p. In particular, it must give the same value for any other fields that agree with 
the chosen ones at p. This, in turn, implies that the map T must be more than just 
multilinear - in fact, it must be function linear, which means that (2. 14) must hold 
for T with the scalars a, b replaced by smooth functions. 2 More precisely, we 
have the following useful result. 

Theorem 3.5 Let T{T M ) ( respectively, T ( T*M ) ) denote the space of all vector 
(respectively, covector) fields on M , 29 Then a tensor field T of type ( r,s ) is a 
function linear map 

28 Technically, we should call such maps function multilinear (because they are function linear in each entry). 

29 For further discussion of the objects T(TM) and r<r*M), see Section 7.1. 
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vb : Y{T*M ) x • • • x Y{T*M ) x Y(TM) x • • • x T(r M) -* J2°(Af). 

r times 5 times 

Proof For simplicity we will prove the theorem for the case r = 0. The general 
case is similar. Suppose that T satisfies the conditions of the theorem. We claim 
that the value of T* at p depends only on the values of Xfp), 1 < i < s. Let {<?,■ } 
be a frame field defined on a neighborhood U of p. Expanding each X, in terms 
of the basis {<?,■ } gives X, = ejXf for some smooth functions Xf. By function 
linearity 

$(Xi, ...,X S ) = X*\ ■ ■ ■ X\<b(e ;i , . . . , e js ), 

so we have 

Wt, • • • , X s )(p) = X h fp) • • • Xj* s (p)V(e h , . . . , e h )(p). 

Let {X'f be another set of vector fields on U that satisfy Xfp) = Xfp), so that 
X'' fp) = X> j(p) for 1 < i < s and 1 < j < n. Then 

*(*i, • • • , X')(p) = X ,h fp) • • • X' h fp)V{e h , . . . , e js )(p), 

= X^fp)---X^fp)^{e h ,...,e js ){p), 


which proves the claim. 

Now, given vectors Y\ , . . . , Y s e T p M define a map 

vb : M -»• T s ° 


by 


:= %(X u ...,X s )(p), 


for any set of vector fields j X t \ satisfying Xfp) = Y,. By construction the right- 
hand side is a smoothly varying function of p, so *b is a tensor field. Conversely, 
given a tensor field we can construct a function linear map of the above type by 
reversing this process. □ 

By duality, Theorem 3.5 allows us to view other function linear maps as tensors. 
For example, suppose we have a function linear map S : Y{TM) — > Y(TM). 
If X is a vector field then S(X) is a vector field, not a function, so techni- 
cally Theorem 3.5 does not apply. But S determines (and is determined by) a 
unique tensor field S : Y(T*M) x Y(TM) — > Y1°(M) given by S(a, X) = 
( a , S(X)). In this sense, we are justified in calling S a tensor field, or more simply, 
tensorial. 


3.11 Differential forms 

3.11 Differential forms 
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Just as a k-vector is a special kind of tensor, namely an alternating one, a k-forrn is a 
special kind of tensor field, namely an alternating one. Viewed in this way they are 
not particularly interesting. But k-forms are by far the most useful kind of tensor 
fields, for several reasons. First, they are easy to define. Second, they are easy to use 
because, for the most part, one does not have to deal with all those irritating indices. 
Third, almost all important geometrical quantities can be expressed in terms of 
forms. And last, but certainly not least, differential forms are essentially the things 
that appear under integral signs. 

Let U be a coordinate patch of M and let p € U. A k-form (or a form of 
degree k) o> p on U at p is an element of f\ (T*M). It follows that, in local 
coordinates, 


CO p = — a i\ -ik d*' 1 A ' ' ' A dx‘ k = <7/ dx 1 , 


(3.58) 


for some constants cii = A (differential) k-form w on M is a smooth 

assignment p — »■ co p . In local coordinates, 


1 


E 


a)u — — a ii...ik( x ) dx n A • • • A dx' k = y ^ a j{x) dx 1 , (3.59) 


where the affx) are now smooth functions on U . The vector space of all k -forms 
on M is denoted Q k (M). In particular, we think of the smooth functions on M as 
0-forms. 

As with tensor fields, it is difficult to give explicit examples of differential forms 
on a general manifold M, because forms by definition are globally defined but 
coordinates are only local. An expression such as (3.59) only specifies the form co 
restricted to a single coordinate patch U . One would have to show that a>u extends 
to a global form on M. We do this in Example 3.10 below. Of course, there is one 
case where it is very easy to give explicit examples of differential forms, and that 
is the case of Euclidean space. 


Example 3.9 On R" we have global coordinates \x l }, so it is easy to define smooth 
forms. For example, 

co — ( 6xy 2 + 3z) dx + (3 y sin z) dy + (e x y) dz 
is a globally defined 1-form on R 3 , while 

p. = 3 x~y dy Adz + lz? dz A dx + 4 xy dx A dy 
is a globally defined 2-form on R 3 . 


Example 3.10 Consider again the 2-sphere with the atlas obtained from stereo- 
graphic projection, as in Example 3.6. Now we do not have global coordinates, so 
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we do not have obvious global forms. Instead, we must debne everything locally and 
then hope that we can patch things together consistently. Typically one does this by 
first defining a form on a single patch and then extending it to the other patches. So, 
for example, suppose that we define a 2-form on the patch V by 


coy 


dv 1 A dv 2 

d + $) 2 ‘ 


First we must check that this is indeed defined everywhere on V. For example, it 
would do us no good to define a 2-form on V by (Jw 1 A dv 2 )/f, because this form 
blows up at (u 1 , v 2 ) = (0, 0), which is a legitimate point of V. But coy does not 
suffer from this indignity, so we’re ok. The next question is, can coy be smoothly 
extended to the rest of S 2 , i.e., is there a 2-form a>n on U that smoothly agrees with 
coy on the overlap U n VI The obvious method turns out to work: we simply define 
con = coy on the overlap (so that they trivially agree), reexpress cojj in terms of the 
u coordinates, and define co\j elsewhere on U using this expression. The reason why 
this works is that changing variables from v to u is a smooth process (although we 
must be careful to verify that the resulting form is well defined everywhere on U). 

So, how do we change variables? The answer is that we just use (3.53). But that 
equation can be confusing because it has too many indices, so instead we just follow 
our instincts and use the chain rule together with the properties of the wedge product 
(in Section 3.14 we will formalize this process): 


dv 1 A cl v 2 


3d 1 


■ du l 


dv 1 


3 M 1 
Su 1 dv 2 
3 w 1 du- 


dw 


3 u 2 
dv 1 dv 2 
du 2 du 1 


SO 1 , v 2 ) , 1 ,2 

: r- dU A dU . 

3(w 1 , m 2 ) 


A 


dv~ 


du l 
du 1 A du 


dir 



(3.60) 


The appearance of the Jacobian determinant on the right-hand side of (3.60) is not 
coincidental; the general rule appears in Section 3.14 below. In any event, using the 
results of Example 3.6 and setting con = coy on U fl V gives 


cou 


dv 1 A dv 2 

(l + §) 2 

1 du 1 A du 2 
T ] 2 (1 + I/77) 2 
du 1 A du 2 

(1 + t ]) 2 


This is a 2-form defined everywhere on U and by construction it agrees with coy on 
the overlap. It follows that there is a globally defined 2-form co on S 2 which equals 
con on U and coy on V. 
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EXERCISE 3.23 In Exercise 3.16 we covered the 2-torus with four coordinate 

patches, t/;, i = 1 4. Construct two global 1-forms dx and dy on the torus 

by extending dxu 1 = dx 1 and dyu l = dy 1 to the rest of the torus. (Because the 
2-torus is two dimensional, these 1 -forms constitute a basis for the cotangent space.) 

3.12 The exterior derivative 

We now introduce a natural differential operator on k-forms, denoted simply by 
the letter d. It is a far reaching generalization of the ordinary gradient, curl, and 
divergence operators of multivariable calculus. 

Theorem 3.6 There exists a unique linear operator 


d : Q k (M) -* £2 k+l (M) 


called the exterior derivative, satisfying the following properties. For any forms X 
and /x, and for any function f, the operator d is 

(1) linear: d(X + p.) = dX + dpt, 

(2) a graded derivation: d (A A p) = dX A p + (— 1 ) de g ; - X A dp., 

(3) nilpotent: d 2 X = 0, and 

(4) natural: in local coordinates {x l } about a point p, 



Two comments are in order. Property (4) makes it appear as though the definition 
depends on the local coordinate system but this is an illusion, for if {y 7 } is another 
local coordinate system at p then, by (3.37), 



(3.61) 


Clearly, by property (4) the exterior derivative on M" coincides with the usual 
notion of the differential of a function. Observe also that property (4) applied to the 
coordinate function x l gives d(x') = dx‘ , which is good. When this is combined 
with property (3) we get 



(3.62) 


another useful fact. Before proving the theorem, let’s look at some examples. 
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Example 3.11 Let co = A dx + B dy + C dz be a 1-form on R 3 . Then 


dco = dA A dx + A A d 2 x 
+ dB/\dy + BA dry 
+ dC A dz + C A d 2 z 


(by (1) and (2)) 
(by (3)) 


= dA A dx + dB A dy + dC A dz 



A dy 


A dz 


(by (4)) 


(dB dA\ 

= I ( dx A dy + cyclic. (3.63) 

\ ox dy J 

Here “cyclic” means that the subsequent terms are obtained from the first by cycli- 
cally permuting x -» v —> z and A — * B — > C. Observe that the components of den 
are just the components of the curl of the vector field (A, B. C ). We shall return to 
this example below. 

Example 3.12 Let co = Ady A dz + B dz A dx + C dx A dy be a 2-form on R 3 . 


Then 


(dA dB dC \ 

dco = I 1 1 1 dx A dy A dz, (3.64) 

\ dx dy dz / 

which is obviously related to the divergence of the vector field (A, B, C). 

Example 3.13 We can use the exterior derivative to define forms (globally) on a 
smooth manifold M. For example, if / : M — » R is a smooth function then, provided 
that it does not blow up anywhere, df is a (globally defined) 1-form on M. 

EXERCISE 3.24 Let / : R 3 — > R be a smooth function. By expanding terms, show 
explicitly that df A df — 0. 

EXERCISE 3.25 Let 


a — 3 x 2 y 3 z dx + xy 3 z 2 dy + 2 yz 2 dz 

be a 1-form on R 3 . Compute da and d 2 a. 


EXERCISE 3.26 Let 

co — 6x 2 dx A dy — 3 xyz 2 dy Adz + x 2 y 2 dz A dx 
be a 2-form on R 3 . Compute dco and d 2 co. 


3.12 The exterior derivative 
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Proof of Theorem 3 .6 Everything can be done locally in a neighborhood of p, 
because properties (1), (2), and (3) of the theorem are clearly coordinate indepen- 
dent, while we have shown that (4) holds in any system of coordinates. First we 
show uniqueness. Suppose there were another linear operator D satisfying proper- 
ties (1) through (4). By linearity, it suffices to show that D = d on a monomial of 
the form a dx 1 . But, by (2), (3), and (4), 

i ^ — > del j, 7 i 

D{adx )= } — -dx A dx =d(cidx ); 

‘— J dx k 

k 

hence D = d. To show existence, just define 

d{adx I ) = clx k Adx 1 , (3.65) 

' dx k 
k 

and extend by linearity. Then property (4) follows by setting 7 = 0. Again by 
linearity it suffices to prove (2) and (3) on monomials. We will prove (2) and leave 
(3) as an exercise. So, let X = a dx 1 and p = bdx J . Then 

d( X a /x) = d(cib dx 1 A dx J ) 



+ (— T) |y ( a dx A (~^~k ^ xk A ^ 
k ^ 


= dk A p + (— l) degi X A dp. 


□ 


EXERCISE 3.27 Prove that the definition (3.65) implies that d is nilpotent 
(property (3)). 

Example 3.14 (Maxwell’s equations) Let M — M 4 be Minkowski spacetime 
equipped with (global) coordinates { t , x, y, z}, so that \dt , dx, dy, dz } is a basis for 
T*M. The inner product on M induces an inner product on T*M given by 

g(dt, dt) = — 1, 
gidt, dx') = 0, 
g(dx‘ , dx-') = S l f 
where i, j = 1, 2, 3. We choose 


<t = dt A dx A dy A dz, 
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30 


so that the results of Example 2.4 apply mutatis mutandis. In particular, we have the 
following Hodge duals: 

*dt = dx A dy A dz, 

*dx = dt A dy A dz, 

*dy = dt A dz A dx, 

*dz = dt A dx A r/y 

and also 


*(c/f A = dy A dz, 

*(dt A c/y) = dz A dx, 

*(dt A dz) = dx A dy, 

*(dx A dy) = — dt A dz, 

*(dy A dz) = — dt A dx, 

+{dz A dx) = — dt A dy. 

We can use all this to reformulate Maxwell’s equations in a particularly elegant form. 
If ( E x , E y , E z ) and (B x , B y , B-) are the components of the electric and magnetic 
field vectors E and B, respectively, we first define a 2-form, sometimes called the 
electromagnetic 2-form, that encodes both fields simultaneously: 30 


Thus, 


F — —E x dt A dx — E y dt A dy — E z dt A dz 
+ B x dy A dz + B Y dz A dx + B z dx A dy. 


— +F = B x dt A dx + By dt A dy + B z dt A dz 

+ E x dy A dz + E y dz A dx + E z dx A dy. 
Applying the exterior derivative operator and collecting terms gives 
dB x d By dE z 


— d *F = 


dx 


dt 


(dt A dx A dy) 


fdBy dB- dEy , 

^-^ + — )( dt AdyA d z) 
(dB z dB x dEy' 

(ff’ + : f’ + ^ u*/ Jv/ *). 


(3.67) 


We will work in the CGS system of units and set c = 1 . Incidentally, we are now in a position to answer a 
question left open in Example 2. 1 . There, we wrote F^ v for the components of the field strength tensor, but 
wondered what the appropriate basis was. We now see that the tensor itself is 

F = F^v dx ^ (g) dx v = jFfiv dx ^ A dx v . 

In other words, the basis elements are just the dx ^ . 


(3.66) 


3.13 The interior product 
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Next, we define a 1-form that encodes the charge and current densities, 

J = —p dt + J x dx + J y dy + J z dz, 

so that 

★/ = — p dx A dy A dz + J x dt A dy A dz 

+ J y dt A dz A dx + J z dt A dx A dy. (3.68) 

Comparing (3.67) and (3.68) we see that Maxwell’s equations with sources, namely 

V • E = Anp, 


dE 

V x B = 4jt J + — — , 
at 


can be written compactly as 


d +F — Art *J. 


(3.69) 


EXERCISE 3.28 Show that Maxwell’s equations without sources, namely 

V • B = 0 

dB 

V x E = , 

3 1 


can be written as 


dF = 0. 


(3.70) 


Remark Note the pleasing symmetry of Maxwell’s equations in the absence of 
sources: dF = d +F = 0. 


Remark The utility of writing Maxwell’s equations in the forms (3.69) and (3.70) 
is not so much to facilitate computations of electromagnetic fields (although this 
can happen) but, rather, to gain an advantage in theoretical understanding. The next 
exercise provides one illustration of this point. Later we will see others. 

EXERCISE 3.29 Show that Maxwell’s equations imply the law of charge conserva- 
tion, 

dp 

V-y+^- = 0. (3.71) 

at 

where J — ( J x , J y , J- ) is the ordinary current density vector field. Hint: d 2 = 0. 


3.13 The interior product 

Given a vector field X we can define a linear map i x : £l k (M) — > taking 

/(-forms to (k — l)-forms, called the interior product, satisfying the following 
properties. 31 Let / be a function, co a one-form, and 7, and /; arbitrary-degree forms. 


31 


In some books the interior product ixco is called the “hook product” and is written X -i co. 
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Then we have: 

(1) ixf = 0; 

(2) i x co = co(X) := (co,X)\ 

(3) i x (X A rf) = i x A A rj + (-l) de ® A A A i x r]. 

(Property (3) says that the interior product is a graded derivation, just like the 
exterior derivative.) If a> is a A: -form and X \ , . . . , X k are vector fields, we define 32 

co(X i, X 2 , . . . , X k ) := (i Xl co)(X 2 , ...,X k ) (3.72) 

or, equivalently, 

u>(Xi, X 2 , . . . , X k ) = i Xk i Xk _ i • • • i Xl co. (3.73) 

Example 3.15 If a and ft are 1 -forms and X and Y are vector fields, 

(a A fi)(X, Y) = iyix (« A £) 

= iyiixot, A ft — a A i x ft) 

= i Y (a(X)ft - ft(X)a) 

= a(X)ft(Y)-ft(X)a(Y). (3.74) 

EXERCISE 3.30 

(a) Show that 

ixdf = df(X) = Xf. (3.75) 

(b) Verify that the interior product is function linear on its argument. That is, show 
that 


ifx — fix, (3.76) 

for any function / and any vector field X. Hint: Use induction on the degree. 

(c) Show that the inner product map on forms is antisymmetric: 

ixftXj = -ixjiXi- (3.77) 

Equivalently, 

co(X u ...,X i ,...,Xj,...,X k ) = -co(X u ...,Xj,...,X i ,...,X k ). 

(3.78) 

Hint: Use another induction. 

(d) Let co — X A/zAv, where A, /r, and v are 1-forms. Evaluate co{X\, X 2 , X$) as a 
sum of products of the form X(Xj)/j.(X j)v(X k ). 

(e) We asserted in footnote 32 that (3.72) (or (3.73)) implied a particular choice of 
prefactor c p in (2.40). Generalizing from part (d), what is c p l 


32 


Although it does not look like it, this definition is equivalent to a particular choice of prefactor in (2.40). See 
Exercise 3.30(e). 
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EXERCISE 3.31 Suppose that X\, . . . , X n are n linearly dependent tangent vectors 
at a point p of a manifold M. Show that a> p (X \ . .... X„) = 0 for every /7-form to. 
Hint: There is a hard way and an easy way to do this problem. The hard way is to 
assume thatw = dx l A- ■ -Adx" and then show that (dx 1 A- ■ ■ Adx n ){X\, . . . , X n ) = 
det X' J , where X' = X ,J dj. The easy way is to use the function linearity and 
antisymmetry of the inner product together with a bit of linear algebra. 


3.14 Pullback 

Let / : M N be a smooth map of manifolds. Given a smooth map g : N —>■ M 
we define a new map f*g : M — > M by f*g = g o f . The map f*g is called the 
pullback of g by /, because the function g is “pulled back” from N to M . (See 
Figure 3. 12.) We extend the pullback map to forms by requiring it to be 

(1) a ring homomorphism, so that 

(i) f*(^ + AO = f*^ + /*F> 

(ii) f*(XAn) = PXAPH, 
and 

(2) natural, that is, d(f*X ) = f*(dk). 

Let fflbea 1-form on N. Let U C M and V C N be open sets with f(U) C V . 
Let y l , ,y n be local coordinates on V and x 1 , ... , x‘" local coordinates on 
U, and suppose that to = ^ a, dy l on V. Then applying properties (1) and 
(2) gives 


f*co = Y,(f*ai)f*Wy i ) 

= ^2( a i ° f) d f 



f*g = g° f 


Figure 3.12 The pullback map. 
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So far, so good. But now we employ the same convention as before (cf. (3.14)) and 
identify the coordinate functions /' with y‘, so that (3.79) becomes 

(f*co)(x) = y2di(y(x)) ^-dx J . (3.80) 

dxJ 

It follows from (3.80) that property (2) is an extension of the usual chain rule from 
calculus, for if g : V — > M then (3.80) implies that 


/*<*) = /* 



V- ~dx’ 

^ dy 1 dx J 


while 


d{f*g) = Y, 


dx-i 


dx^ = 


9(g° /) 

dx-i 


dx J . 


We see that the statement f*(dg ) = d(f*g ) is just the ordinary chain rule, which 
is why property (2) is called “natural”. 


Example 3.16 Let a = 6xydx + 2 yzdy + x 2 dz be a 1-form in R , and let 
y : I -* R 3 be a parameterized curve given by t (t, t 2 , f 3 ). Then 

(y*am = 6 m 2 )(dt) + 2(r)(t 3 )(2t dt) + (r)(3f 2 dt) 

= (6t 3 + 3r 4 + 4r 6 ) dt. 


Example 3.17 Let a> = y d y /\d z+2y dzAdx+xz dxAdy be a 2-form in IB’ 3 and let 
ct : I x / -> R 3 be a parameterized surface given by (n, v) — > (u cos v, u sin v, 2m 2 ). 
Then 

( g*w)(u , v) = (u sin u)(sin v du + u cos v dv) A (4m du) 

+ 2 (u sin i>)(4« du) A (cos v du — u sin v dv) 

+ (n cos v)(2u )(cos v du — u sin v dv) A (sin v du + u cos v dv) 

= (—4m 3 sin v cos v — 8m 3 sin 2 v + 2u 4 cos v) du A dv. 


In general, suppose that a> is an //-form on an n -dimensional manifold M and is 
represented in local coordinates on V by 

co = a(y) dy 1 A • • • A dy n . (3.81) 

Pulling this back to U by means of f* gives 

/*« = (/*a)/ ^) A • • ■ A / W) 

= (a o f)d(f*y l ) A • • • A d(f*y n ) 

= (a o f) df x A • • • A df' 1 
= (a o y) dy 1 A • • • A dy 11 


3.15 Pushforward 
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= ( aoy'>T?i 

J ^ dx‘i 
dy ,- 
3 x' 


= (aoy) 


dv n 

— — dx n A • • • A dx' n 
dx‘ n 

dx l A • • • A d*", 


(3.82) 


where [3^' /3jc 7 ] denotes the usual Jacobian determinant. 33 (For the last step, see 
(2.53); compare with (3.60).) It is no accident that this looks like the change of vari- 
ables theorem in calculus, as the pullback operation together with the antisymmetry 
of the wedge product is designed to reproduce that result. 


EXERCISE 3.32 Let 


to = 3 xz dx — ly~z dy + 2 x 2 y dz 

be a 1-form on R 3 , and suppose that tf> : R -> R 3 is the map </>(f) = ( t 2 , 2 1 — 1, t ). 
Compute the 1 -form cp*co. 

EXERCISE 3.33 Let 

0 = —x dy A dz — y dz A dx + 2z dx A dy 

be a 2-form on R 3 , and suppose that <p : R 2 — > M 3 is the map <p{u,v) = 
(m 3 cos v, « 3 sin v, ir). Compute the 2-form cp*0. 

EXERCISE 3.34 Let to = 4 y dx + 2 xy dy + dz and 0 : R 2 -» R 3 be given by 
( p(u , v) = ( u , v , u 2 + v 2 ). Show that dcf>*co — <p*dco. 


3.15 Pushforward 

Let / : M —>■ N be any smooth map. We have seen how the pullback map /* 
naturally pulls a form on N back to a form on M. One can ask a similar question 
regarding vector fields. Do they pull back as well? The answer to this question 
is complicated, and we shall return to it at the end of this section. Instead, we 
introduce a map that is very familiar already, although the language may not be so. 
If g : N — > M is a smooth function on N, we define a linear map /* : T p M — > 
Tf( P )N of tangent spaces, called the pushforward, by 

(f*Xp)m(8)-=X P <J*g)- (3-83) 


33 


We use square brackets here and occasionally elsewhere rather than the standard vertical brackets in order to 
emphasize that we are referring to the determinant itself rather than its absolute value. 
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A hint as to the identity of the pushforward map is provided by its other name, the 
derivative or differential of /. ' The next exercise provides the connection. 

EXERCISE 3.35 If {x 1 } are local coordinates in the neighborhood of p e M and 
{y J } are local coordinates in a neighborhood of f{p) e N, show that (/*);/ = 
(dy l /dx J ) p relative to the local bases d/dx‘ and 3/3 y J . In other words, in a local 
basis the derivative map is represented by the Jacobian matrix. 

Remark In terms of the pushforward map, the inverse function theorem can 
be stated as follows: the map / : M — ► N is a local diffeomorphism in the 
neighborhood of a point p if and only if /* is an isomorphism of T p M and Tf( P) N. 

EXERCISE 3.36 Show that, if / : M — > M is the identity map then f* and /* 
are both identity maps. Show also that if /:!/-» V and g : V — > W then 
(g ° /)* = f*°8* and (g ° /)* = g* ° f*- In fancy mathematics lingo, one says 
that the pullback is a contravariant functor while the pushforward is a covariant 
functor. 

Now, let / : M — > N be smooth and suppose that X is a vector field on M. 
Furthermore, suppose that / is not injective, so that f(pi) = f(pi) = q ■ Then 
/* maps the two tangent spaces T pi M and T P2 M to the same image, namely T q N . 
But there is nothing to guarantee that X pi and X P2 will map to the same image 
and if not the image of X under /* will not be a smooth vector field on N. This 
sort of pathology cannot occur if / is a diffeomorphism, because in that case / 
is bijective. Therefore, when / is a diffeomorphism it makes sense to define the 
pushforward of the vector field X in the obvious way, namely 


(f*x)n P ) f*x p . 

Equivalently, if g : N R is any smooth function then 

(f*x) m (g) = x p (f* g ). 


(3.84) 


(3.85) 


This looks so much like (3.83) that people often get confused. Just remember that 
you can push vectors forward by any smooth map but you can only push vector 
fields forward by a diffeomorphism. 

Remark There is another way to write (3.85) that is often more useful in 
computations. Viewing both sides of (3.85) as an equality of functions on M, 
we have 


(Evaluate both sides at p e M to recover (3.85).) 


(3.86) 


34 For this reason one often sees /* written as Df. 
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Theorem 3.7 The Lie bracket of vector fields is natural with respect to pushfor- 
ward by a dijfeomorphism. That is, for any diffeomorphism (p : M N and vector 
fields X and Y on M, 


(PAX, T] = [cp,X, <p,Y]. 


(3.87) 


Proof It suffices to show that both sides of (3.87) yield the same result when 
acting on any smooth function g : N — »■ M. Applying (3.86) twice gives 

YX(cp*g ) = Y[<p*[((p*X)(g)]] = <p*[(<P*Y)faX)(g)]. (3.88) 

Yet another application of (3.86) (with X replaced by [A, y]) gives 

<p*[(p*[x, y](g)] = [x, Y](<p* g ) 

= (XY - Y X)(cp*g) 

= XY(cp*g)-YX(cp*g) 

= <p*[(<p*X)faY)(g) - (<p*y)(<p*X)(g)], 


where we used (3.88) (and a similar equation with X and Y interchanged) in the 
last step. But is a bijection, so we may strip tp* from both sides to get (3.87). □ 

In the special case where / : M — ► N is a diffeomorphism, the pushforward 
operation can be extended to general tensor fields by recalling the definition of 
a tensor as a multilinear operator. Let p e M and q := / ( p ) e N . Let X ; e 
T N and co' e T*N, and let S be a tensor field of type (r, s) in M. As / is a 
diffeomorphism, the inverse function theorem guarantees that /* is an isomorphism 
of tangent spaces. In particular, /* is invertible. Thus we can write 


(US)((o\...,(o r ,X u ...,X s ) 




(3.89) 


Alternatively, if R is a tensor field of type (r, s) in N then we can pull it back to 
Mby 


(f*R)(p\...,p r ,Y u ...,Y s ) 

p 

:=fi(/-V,...,rV,/Jt W) 


(3.90) 


where Y, e TM and pi € T*M. In this sense, pushforward and pullback are 
“inverse” operations. 

At last we can answer the question posed at the beginning of this section: do 
vector fields pull back? The answer is that they naturally push forward, but by 


100 


Differentiation on manifolds 


virtue of the above correspondence we can define the pullback of a vector field X 
by a diffeomorphism <p to be the pushforward via the inverse map, so that 

<P*X := (3.91) 

and this is what we shall do. 

3.16 Integral curves and the Lie derivative 

In addition to the exterior derivative of a differential form there is another intrinsic 
derivative on manifolds, where by “intrinsic” we mean that its definition does not 
require the introduction of any additional structure. If X is a vector field on M then, 
as we have seen, X can be thought of as a generalized directional derivative oper- 
ator on functions. But we want to be able to take derivatives of tensor fields, and 
X does not appear to act directly on tensor fields. It turns out that the pushforward 
map provides the tool we need. 

Given a vector field X on M, the fundamental theorem of ordinary differential 
equations guarantees that there is a unique maximal curve y(t) : I — > M through 
each point p e M such that y(0) = p and such that the tangent vector to the 
curve y at y(i) is precisely X y ( t) . The curve y is called the integral curve of X 
through p. 35 

To see how this works, it is worth taking a moment to discuss notation. In ordi- 
nary calculus we call y'(t) the tangent vector to the curve. The reason is that the 
curve y is a map into some Euclidean space, so y'(t) is naturally a vector. But, as 
we have previously discussed, this interpretation is no longer tenable in a general 
manifold that is not itself a vector space. Instead, vectors are derivations, so we 
really want to understand y'(t) as a derivation. How do we do that? 

The answer is to be a bit more precise. We view 7 as a manifold, namely an 
open interval of the real line, and attach the coordinate t to the points of 7. Then 
d/dt is a tangent vector field on 7 which acts on functions / : 7 — »■ M according 
to ( d/dt)f = df /dt. Now we define y'(t) := y*(d/dt) y ( t ). Although this may 
look intimidating, it’s really not. In fact, it allows us to reduce the question of the 
existence of an integral curve in a general manifold M to a system of ordinary 
differential equations. 

To see why this is so, let x 1 , ... , x" be local coordinates around p. Applying 
both sides of 


/(0 = Xym (3.92) 

to the coordinate functions x’ gives 

35 The range of 1 is not specified, but "maximal” means that it is large enough that the curve exists in a neigh- 
borhood of p and no other integral curve covers more points of M. The question of how far such a curve can 
be extended on the manifold M is a delicate matter and the subject of much investigation. Indeed, some very 
beautiful results of differential geometry give conditions for the existence of global integral curves. 
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Figure 3.13 Flowing along the integral curves of a vector field. 


y*(d/dt)(x‘) = -j~Y*(x l ) = ^r(x‘ o y) = j -y l (t ) = ^-x'{t), 
dt dt dt dt 


(3.93) 


and, taking X = X j d/dx' , we have 



(3.94) 


Equating (3.93) and (3.94) gives the system of differential equations 



(3.95) 


whose simultaneous solution yields the curve y. We must “integrate” these 
equations to get the curve; hence the name “integral curve”. 

The existence of integral curves enables us to define the notion of a flow. At each 
point p e M we can imagine moving a parameter a distance t along the integral 
curve y of X through p. If we do this for all points of M simultaneously we get 
what is called a flow along X, where you can think of this movement as the flow of 
a fluid. (See Figure 3.13.) More precisely, for any open set U of M and any p e U, 
define y p (t) to be the integral curve of X through p, with p = y p ( 0). Then the 
map 4>, : U — »■ M given by y p (0) y p (t) is a flow, or a one-parameter group 
of diffeomorphisms of M generated by the vector field X . 36 

Using this flow, we can generalize the notion of the directional derivative to 
tensor fields. First observe that, in terms of the flow 0, (and writing f p for f(p) 
and 4> t p for y p (t)). 



(3.96) 


If we tried to do something similar for tensors, it might look like this: 


{C X T) P = lim ^ ^ (???), 

y t-> o t 


It is a group of diffeomorphisms under composition, since (p s +t = 0s ° 4>t = 4>t ° <Ps with = (f) t ^ and 
</>0 = id. 


36 
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where C x denotes our generalized directional derivative operator. The problem 
with this definition is that it makes no sense, because T^ tP and T p live in different 
(unrelated) vector spaces, so we cannot subtract them. But, if we were to drag Tfap 
back to p by means of the flow 0, then we could compare its value with T p . 

The right way to proceed is to rewrite (3.96) as 


for then 


(Xf) p = lim 


</>,*/ - / 


r -»0 


(Xf)p = lim 


/(<Mp)) ~ f(p ) 


r ->0 


(3.97) 


as before. By analogy, we define the Lie derivative C X T of a tensor field T in the 
direction X at p by 


(£ X T)„ := lim 


<P?T 


= 

dr’ 


t = 0 


(3.98) 


where the pullback of a tensor field is defined by (3. 90). 3 Note that a scalar field 
can be viewed as a type (0, 0) tensor field, so that (3.90) applied to a scalar field 
just reduces to the ordinary pullback of functions. In particular, we have 


(C x f)p = (Xf) p . (3.99) 

The following properties are immediate from the definition (3.98): 

(1) C x is linear; 

(2) C x preserves tensor type. 

Also, the Lie derivative is, well, a derivative, so 


(3) C x is a derivation: C x { S ® T) = C X S ®T + S ® C X T . 


37 Other authors write this a little differently. For example, one equivalent way to write it would be 

P—t*T — T 


(£xT)p= ■ lim t 


.. ( 0 — t*T)p — Tp 

= Inn — , 

1-y 0 t 


although in the spirit of (3.83) it is sometimes written as 

iCxT )p = lim 

y t-> 0 t 

instead, because one imagines the first term on the right as “pushing” the tensor field T at </v p backwards 
to p. Lastly, as </>q* = id, the equation is sometimes written as 


(C x T) p = hm </>;* 


(<P-,*T-T\ 


T-<P,*T 
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Some properties that are not so obvious include the following. 

(4) L x is compatible with the natural pairing of forms and vector fields: 

L x (co,Y) = {L x cd,Y) + ((d,LxY). (3.100) 

(5) For any vector field Y , 

L x Y = [X, Y]. (3.101) 

(6) For vector fields X, Y, and Z, 

[. L x ,Ly]Z = L [X , Y] Z , (3.102) 

where \L X , /Zy] := L x Ly — Ly L x . 

We offer a direct proof of property (5) here, leaving a slightly different proof to 
the following exercise. It suffices to show that ( L x Y)f = [ X , Y]f for all smooth 
functions /. Thus, letting 4>t be the flow associated to X and using (3.86) in the 
form 


(^g = o r x )*Y((t>*g ) 


(3.103) 


gives 


4>*Y - Y 

(£ X Y)f = lim — — / 

/— ► o t 

,. fy-t*Y — Y 

= hm / 

/— >■ o t 

_ lim ff(r(0*,/)) - - Yf 

/-»• 0 \ t J r->0 ? 

= F(-X/) + X(F/) = [Z, F]/. 


(We have used the facts that </>o = /J and that </)_, is the flow associated to the 
vector field —X.) 


EXERCISE 3.37 Prove properties (4) through (6) above. Hints: For property (4), 
first show that pullback commutes with contraction, in the sense that 


(f> * {to, Y) = (<j,fco, cp*Y ) , (3.104) 

where cpfY means ((/> r _1 )*T. For property (5), use (4) with w = df and recall that d 
commutes with pullback. For property (6), use (5) and the Jacobi identity. 
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Roughly speaking, the Lie derivative measures how the tensor changes as it 
moves along the integral curves of X. It turns out to have many uses, but it suf- 
fers from one major problem, namely that it depends on X in a neighborhood of 
p and not simply on X at the single point p. This necessitates the introduction of 
additional derivative operators, as we shall see. 

Additional exercises 

NOTE: Henceforth we drop the boldface font for matrices whenever there is 
little or no risk of confusion between a matrix and its corresponding linear 
operator. 

3.38 The circle By analogy with Example 3.6, show that S l = { (jc , y) e M 2 : 
x 2 + y 2 = 1} is an orientable manifold. (No, you cannot cheat and use 
Theorem 3.2.) 

3.39 Cartesian products of manifolds Show that the Cartesian product M x N 
of two smooth manifolds M and N (of dimensions m and n, respectively), is 
naturally a smooth manifold of dimension m+n. Furthermore, if M and N are 
both orientable, show that M x N is orientable. Hint: If M has atlas {((/,, tp,)} 
and N has atlas {(V j, f f], show that M x N has atlas {£/,- x V), (pi x f f, 
where {tp, x x//j)(p,q) := (^ pfp ), fj(q)) e M m x M". Remark: From this 
result we immediately obtain that the n -torus, defined by 

T n = S l x ••• x S\ 

■ v ' 

n times 


is a smooth n -dimensional manifold, because the circle S' 1 is a smooth 
manifold. 

3.40 Area forms on the 2-sphere The standard area element on the unit 2- 
sphere S 2 is o = sin 6 dO /\d(f> in spherical polar coordinates (r, 0. </>)■ Let / : 
M 3 — {0} — > S 2 be the projection map x x/r, where r = (x 2 + y 2 + Z 2 ) 1 / 2 . 
(a) Show that 


f*o = —(xdyAdz + ydz/\dx + zdxA dy). 
r i 

(b) Show that, in terms of the stereographic coordinates (u 1 , if) and (i; 1 , rf) 
described in Example 3.6, 


du l A du 2 

o v = -4 


dv l A dv 2 


(1 + q ) 2 


and o v = +4 


(l + £) 2 ' 
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Compare these forms with those of Example 3.10. Hint: Restrict f*a 
from part (a) to the sphere, then use the coordinate transformation 
between Cartesian and stereographic coordinates obtained in Exam- 
ple 3.6. Alternatively, you could calculate the coordinate transformation 
between polar and stereographic coordinates directly. 

3.41 Elliptic curves The zero set T, of the polynomial 

f(x, y) = y 2 — x 3 — ax — b, a,b e M, 

in the plane is called an elliptic curve (over the reals) provided that it has 
no cusps or self-intersections - i.e., provided that it is an embedded smooth 
submanifold of M 2 . For what values of a and b does / define a real elliptic 
curve? Hint: Recall that a cubic polynomial of the form x 3 + ax + b has 
repeated roots only if the discriminant A = — 16 (4« 3 + 21b 2 ) vanishes. 

3.42 Regular values and tangent spaces Let / : M — »■ N be a smooth map 
of manifolds with m = dim M > dim N = n and let q € N be a regular 
value of /. Let X = f~ l (q) and let p e X. Show that T p X is just the kernel 
of the differential /* : T p M — »■ T q N . Hint: X is a submanifold of M, so 
T p X is a subspace of T p M. Consider /* restricted to X and don’t forget the 
rank-nullity theorem. 

3.43 Lie groups A Lie group G is both a smooth manifold and a group such 
that the group operations (g, h ) i-> gh and g m<- g~ l are smooth maps. 

(a) The set M„(R) of all real n x n matrices is easily seen to be an n 2 - 

2 

dimensional smooth manifold isomorphic to R' r . Each point is a matrix 
A, and the components of the matrix serve as coordinates for the point. 
In other words, the coordinate functions {xij) are given by Xjj(A) = A,j. 
The general linear group GL(n, M) is the subset of M„(R) consisting of 
all matrices with nonzero determinant, the group operation being ordi- 
nary matrix multiplication. Prove that GL(n,l ) is an n 2 -dimensional 
Lie group. Hint: Show that it is an open subset of M„(M) and that the 
group operations are smooth. Remark: A matrix group is a subgroup of 
GL(n, M). 

(b) Show that the set 0(n) of all orthogonal matrices is a Lie group. It is 
called the orthogonal group. What is its dimension? Hint: Let M+ (M) 
be the set of real symmetric n x n matrices, and consider the map <p : 
M n (R) — > M+(M) given by (p(A) = AA r . Use Theorem 3.2 with q = 1. 
The only tricky thing is to prove that 7 is a regular value of <p. For this, 
show that (P*aB = AB 1 + BA 7 and go from there. Note that, because 
M„(M) and M+( R) are vector spaces, we have T p M„ (M) = M„(M) and 
T q M+(R) = M+( R). 
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(c) The orthogonal group is divided into two classes of matrices: those with 
determinant + 1 and those with determinant — 1 . The ones with determi- 
nant + 1 constitute the special orthogonal group, denoted SO(n). Show 
that SO(n) is a Lie group but the other class of matrices is not. Hint: Use 
Theorem 3.2 again, then recall (2.54). 

(d) Show that SO {2) is diffeomorphic to the circle (1 -sphere) .S' 1 . Hint: Con- 



without proof. If tp : M — > N is a smooth map of manifolds, and M' C M 
and N' C N are submanifolds with <p{M') C N' , then the restriction map 
<P\m' '■ M' — > N' is smooth. 

3.44 Lie algebras Let G be a Lie group with identity element e. Define a map 
(actually a diffeomorphism) L g : G — »■ G given by L g (h) = gh (“left mul- 
tiplication by g”). A vector field X is left invariant if L g *X = X for all 
g e G (which is a sloppy but convenient way to write L glf X h = X Kh for 
every g, h e G). 

(a) If X and Y are two left-invariant vector fields, show that their Lie bracket 
[X, F] is also left-invariant. 

(b) Show that a left-invariant vector field X is determined uniquely by its 
value X e at the identity and, moreover, given a vector Y e T e G, there 
is a unique left-invariant vector field X with X e = Y . Thus, with the 
obvious identification [ X e , Y e \ := [ X , F] e , T e G inherits the structure 
of a Lie algebra, usually denoted g, called the Lie algebra of the Lie 
group G. 38 

(c) A one-parameter subgroup of G is a smooth homomorphism y : M — > 
G, written t t-* y(t). In other words, it’s a smooth path in G satisfy- 
ing y(s + t) = y(s)y(t). (Observe that this automatically implies that 
y (0) = e and y{t)~ l = y(—t). Why?) Show that there is a bijection 
between the set of one-parameter subgroups of G and the Lie alge- 
bra g of G. Hint: Consider the integral curves of a left invariant vector 
field. To show that a homomorphism is obtained, consider the deriva- 
tive of y(s + t) — y(s)y(t) and use the uniqueness of integral curves. 
You should also show that the domain of y can be extended to the 
whole of M (from which we conclude that left invariant vector fields are 
complete). 


38 


The Lie algebra g is therefore a (dim G ) -dimensional subalgebra of the (infinite-dimensional) 
Lie algebra of all vector fields on G. 


Additional exercises 


107 


Remark: Let y x (t) denote the one-parameter subgroup corresponding to 
the left invariant vector field X. By analogy with the usual exponential 
function, which satisfies e s e‘ = e s+t , one writes y x (t) = ExptX. 
The map Exp : g G given by X Exp x (l) is called the 
exponential map. 39 In general the exponential map is neither injec- 
tive nor surjective, but it is still quite powerful. It furnishes a link 
between Lie groups and Lie algebras that enables one to reduce ques- 
tions about Lie groups to questions about Lie algebras. Specifically, there 
is an almost unique correspondence between a Lie group and a Lie 
algebra, and the exponential map provides a way of associating a Lie 
group homomorphism to a Lie algebra homomorphism and vice versa. 
For a nice overview, see Graeme Segal’s chapter in [15] or, for more 
details, [88]. 

3.45 The Lie algebra gl(n, M) A Lie algebra isomorphism tp : a — > b is a vec- 
tor space isomorphism that is also an algebra homomorphism: tp(\X, Y\ a ) = 
[cp(X), <p(T)]b- The Lie algebra of the general linear group GL(n, M) is 
denoted gl (n, M). Show that there is a Lie algebra isomorphism gl(n, M) -> 
M„(M), where M„(M) is a Lie algebra under the bracket operation [ ,4 . B\ — 
AB — BA (the commutator of A and B). Hint: A neat proof proceeds as fol- 
lows. (i) Let x := (x, ; ) be the matrix of coordinate functions on GL(n, M), 
and define 0 := x~ l dx, a matrix of 1 -forms on GL(n, M). (Incidentally, © is 
called the Maurer-Cartan form of GL {n , M).) (ii) Show that 0 is left invari- 
ant, meaning that, for every A e GL(n, M), we have L* A ® = 0; thereby show 
that each entry ®jj is left invariant, (iii) Let X be a vector field on GL(n, M) 
and define a map X B x , where B x '■= (®,X). (The latter notation means 
that each entry of 0 is paired with X.) Show that if X is left invariant then 
this map induces a vector space isomorphism g[(n, M) — > M„(M) by demon- 
strating that 0(X) is a constant on GL(n,W). You will probably want to use 
(3.104). (iv) Given X, Y e gl (n, M), use (3.123) to show that dQ(X, F) = 
— 0([X, F]). (v) By differentiating, show that d 0 = —0 A 0. (The left-hand 
side of this expression means you are to apply d to each entry of 0. The 
right-hand side of this expression is a matrix of 2-forms whose ij th entry is 
(0 A &)jj = ©,-* A 0jy. In particular, 0 A 0 does not vanish.) (vi) Use 

(v) to evaluate <70 (X, F) and compare the result with (iv) to conclude that 
[X, F] ^ [B x , BA. 

For more on the exponential map in the context of Riemannian manifolds, see Exercise 8.50 and 
Appendix D. 
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3.46 The Lie algebra so (n) Show that so(n), the Lie algebra of SO (n), can be 
identified with the Lie subalgebra of M n (M) consisting of all n x n skew sym- 
metric matrices together with the commutator as bracket. (A Lie subalgebra 
is just a subset of a Lie algebra that is itself a Lie algebra.) Hint: Consider the 
tangent vector to a curve in SO(n) at the identity e. Show that so(n ) is a Lie 
subalgebra of M„ (R). 

3.47 The exponential map Show that, under the isomorphism gl(«, K) = 
M„(R), the exponential map Exp : gl(n, M) — >• GL(n, M) is given by 


Expf A = e' A , 


(3.105) 


where the latter expression is the ordinary exponential operator applied to the 
matrix tA : 



(3.106) 


Hint: The curve y(t) = Exp tX is the unique integral curve associated to the 
left-invariant vector field X, which means that dy /dt = A, /(/) = L y ( ty . f X r ,. 
L Alf is just left multiplication by the matrix A. (Why?) Ignore questions of 
convergence, which all work out in the end. 

3.48 The Lie algebra su (n) All the previous results have corresponding com- 
plex analogues. The Lie group GL(n, C) is the set of all n x n matrices with 
complex entries and nonzero determinant. It is a Lie group diffeomorphic 
to R 2 " 2 . The Lie algebra g[(/i, C) of GL(n, C) is isomorphic to M n { C), the 
algebra of complex n x n matrices with commutator as bracket. The ana- 
logue of the orthogonal group 0(n) is U(n), the unitary group, consisting 
of all n x n unitary matrices. (Recall that a matrix A is unitary provided 
that AA f = /, where A' is the conjugate transpose of A and I is the iden- 
tity matrix.) The special unitary group SU (n ) comprises n x n unitary 
matrices with determinant equal to unity. Show that the Lie algebra su(/i) 
of SU (n) can be identified with the Lie subalgebra of M n ( C) consisting of 
all traceless n x n anti-Hermitian matrices. (A matrix A is anti-Hermitian, or 
skew-Hermitian, if A T = —A.) Hint: For the anti-Hermiticity requirement, 
proceed by analogy with the method of solution of Exercise 3.46. To show 
tracelessness, combine Exercises 3.47 and 1.48. 

3.49 The adjoint representation of a Lie group Let G be a group. A rep- 
resentation of G on a vector space V is a homomorphism G — > Aut V . 
There is a natural representation of a Lie group G on its own Lie algebra, 
defined as follows. Given an element g e G we define the conjugation 
action tp(g) : G G by (p(g)h = ghg~ l . By the properties of Lie 
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groups this map is smooth, so we may consider its derivative at the identity 
V(g)*,e '■ T e G — »■ T e G. Identifying T e G with the Lie algebra g we get a 
map Ad : G — > Autg given by Adg = By the chain rule this map 

is a homomorphism, so it defines a representation of G called the adjoint 
representation. 

(a) Show that Adg really is an automorphism of g by showing that it is 
bijective and is an algebra homomorphism. 

(b) Show that, when G = GL(n, M), A e G, and B e gl(«, M), 

(Ad A)(B) = ABA~\ (3.107) 

where the product operation on the right-hand side is ordinary matrix 
multiplication. 

3.50 The adjoint representation of a Lie algebra Let g be a Lie algebra. A 
representation of g on a vector space V is a Lie algebra homomorphism 
g — > Aut V, where the Lie algebra structure on Aut V is defined by the 
commutator. The adjoint representation of g is the map ad : g — > Aut g 
given by 

adX(F) := [X, F], (3.108) 

As the terminology suggests, there is an intimate connection between the 
adjoint representation of a Lie group and its Lie algebra. Specifically, we 
have 


Ad Exp X = Exp ad X. (3. 109) 

The best way to understand this relation is to prove it for the special case 
of a matrix group such as GL(n, M). In that case, the exponential map just 
becomes the ordinary matrix exponential, so (3.109) reads 

Ade x = e adx . (3.110) 


Prove (3.110). Hint: Replace X by tX, apply both sides to a matrix F and 
call the resulting equation (*). Differentiate (*) with respect to t and show 
that both sides of (*) satisfy the same first-order differential equation with the 
same initial conditions. Invoke a uniqueness theorem. Remark: The two kinds 
of adjoint map are affectionately known as big Ad and little ad, although you 
should be aware that some authors write “ad” for what we have called “Ad”. 
Caveat lector. 

3.51 Fun with matrix exponentials Show that if A, B e Af„(M) and 
[A, [A, B]] = [B, [A, B]] = 0 then 


e A e B = c 


,A+B+A/2)[A,B] 


(3.111) 
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In particular, it follows that 


e A e s = e A+B 

if A and B commute. Hint: Write 

e t(A+R) = e ,A f{t), 


(3.112) 


(3.113) 


then show that 


fit) = e ' B e -^ 2 rA[A,B] (3.114) 

by differentiating (3.113) and demonstrating that both sides of (3.114) obey 
the same differential equation with the same initial conditions. Remark: In 
general, if e A e B = e c , the Baker-Campbell-Hausdorff formula gives a 
complicated expression for C in terms of A, B , and higher-order commuta- 
tors. Its utility is that it permits one to recover the Lie group locally from its 
Lie algebra. For more details, see e.g. [34]. 

3.52 The Killing form of a Lie algebra Every Lie algebra g comes equipped 
with a natural symmetric bilinear form (•, •), called the Killing form, given by 

(X, 7) :=tr(adXoad7). (3.115) 

If the Killing form is nondegenerate, g is said to be semisimple. Show that 
the Killing form satisfies the following properties. (The idea is to demonstrate 
the properties directly from the definitions. For half credit, you may assume 
g to be a subalgebra of g[(/i, M) -i.e., an algebra of matrices.) 

(a) It is symmetric, so that 


(X,Y) = (Y,X). (3.116) 

(b) It invariant under the natural adjoint action: 

(Adg(X),Adg(7)) = (X,7). (3.117) 

Hint: Show that, for any automorphism T of g, ad(T X) = T o ad X o T~ l 
and use Exercise 3.49(a). 

(c) For every Z, 


(ad Z(X), 7) = —(X, ad Z(Y)). (3.118) 


Hint: Show that ad[X, 7 ] = [ad X, ad 7], 
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3.53 The Lie algebra so (3) By the result of Exercise 3.46, the Lie algebra so (3) 
may be viewed as the vector space of all 3 x 3 skew symmetric matrices, so 
one possible basis is 

/ 0 1 0\ /0 0 -1\ /0 0 0\ 

Ex := ( -1 0 0 , E 2 := 0 0 0 , and E 3 := 0 0 1 . 

V o o o/ V 1 0 o/ V° - 1 o/ 

Evidently, so(3) is three dimensional. 

(a) By explicit computation, show that 

\Ei, Ej ] = ^stjkEk, 

k 

where is the sign of the permutation (;, j, k). Remark: The numbers 
are called the structure constants of the Lie algebra so (3) relative 
to the given basis. Note that the structure constants also give the matrix 
elements of the adjoint representation of so (3), relative to the same basis. 

(b) Use the result of part (b) to compute the entries g,j := (£), E j) of the 
Killing form and thereby show that it defines a nondegenerate inner prod- 
uct on so(3). What is its signature? Hint: Although it amounts to overkill 
here, you could use the tensor identity (8.139). 

3.54 Isomorphism between so(3) and su(2) Show that so(3) and su(2) are iso- 

morphic Lie algebras. Hint: Show that one basis for su(2) consists of the 
set t u (— i/2)o*, k = 1, 2, 3, where i = V-T and the ay arc the Pauli 

matrices: 


cr i := 




and 




Consider the map 4> '■ su(2) — > so (3) given by cr, i->- extended by 
linearity. 

Remark: Lie algebras give local but not global information about Lie 
groups, so the fact that so(3) and su(2) are isomorphic tells us only that 
SO( 3) and SU (2) are locally isomorphic. Actually, 51/(2) is the universal 
(double) cover of 50(3). (A Lie group G is a covering space (or simply 
a cover) of another Lie group H if there exists a smooth surjective group 
homomorphism <p : G — »■ H . It is universal provided it is simply connected. 
A double cover is a cover whose fibers tp~ l (p) each consist of a pair 
of points.) This mathematical fact about SU( 2) and 5(9(3) has physical 
implications. In quantum theory a spin- 1/2 particle is represented by a (Pauli) 
spinor, namely an element of a two-complex-dimensional vector space that 
transforms under rotations according to an element of SU (2). Specifically, if 
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h denotes a unit vector in M 3 and 9 the angle of rotation about the h axis then 
the corresponding element of SU (2) is R := cos(#/2)7 — i sin((9/2)(n • a), 
where / is the identity matrix. After one full rotation through an angle 
2tt, a spinor is transformed to the negative of itself, unlike an ordinary 
vector, which always returns to its original value after a 2tt rotation; this 
property has observable consequences. Mathematically, the preimage under 
<p : SU (2) — »■ 50(3) of the identity in 50(3) consists of the two maps id 
and —id in SU (2), as you can see by substituting 9 = 2n and 9 = 4jt into R. 
It is not difficult to show mathematically that SU (2) is simply connected, but 
there are also a few amusing physical models that illustrate the same thing. 
Look up Feynman plate trick, Dirac belt, spinor spanner, or Balinese candle 
dance. 

3.55 Lie derivatives of forms Let co be a k-form and let X, Y, and {A/}, 
i = 1, . . . , k be vector fields. Prove the following formulae involving the 
Lie derivative. Hint for all parts: Use induction. 

(a) (Cartan’s formula) 


E x to — (ixd dix)a>. 

(b) (Another formula of Cartan) 


(3.119) 


C x iyO) — iyCxM = ic x yto. 

(c) (A useful identity) 

0 C Xo co)(X u ...,X k ) = C Xo (cd(X u X k )) 


(3.120) 


k 


~Y J ^x,...,Cx 0 X i ,...,X k ). 


X k ). (3.121) 


i=i 


Extra hint: Use the previous part. 

(d) (A relation between the exterior derivative and the Lie derivative) 
dco(X Q ,Xi,...,X k ) 


k 


= J2(- i y£x i (co(X 0 , x k )) 



0<i< j<k 
k 


= ^(-l) i X i -(m(X 0 ,..., % X k )) 



o<; <j<k 


(3.122) 
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In particular, when k = 1 we get the very useful equation 

dco(X, Y ) = Xco(Y) - Yco(X) - co([X, 7]). (3.123) 


Extra hint: Use parts (a) and (c). 

3.56 Lie derivatives of tensor fields Show that, in a coordinate basis, the 
Lie derivative along X = X' 3, of a tensor field T of type (r, s) has 
components 


(£ x T) h - ir 


n-js 


= T 


n-b 


h-js,k 


X k 


j^k i*2-. 


Jl-Js 

A. Th-b . 
1 kj 2 ...js 


iX'\ k 
h 


- 1 hk...i r 


Y'2 , 
J i-A A ,k 


rpil—ir-lk k 


+ T' l - l ' jlk ... js X k j2 + 




n~js- 

yk 

h—is-ik A ,j s ) 


(3.124) 


where the comma followed by i in /, is shorthand for a coordinate derivative 
df/dx 1 . Hint: Recall from (3.26) that 


E X Y = {X l Y k x -Y l X k x ) 3,. (3.125) 

Use the compatibility of the Lie derivative with dual pairing (contraction) to 
show that, for a 1-form a = a, dx l , 


= (*V^ + utX\ k )dx k , (3.126) 

then use the derivation property. 

3.57 Symplectic forms and classical mechanics A smooth 2n -dimensional 
manifold M equipped with a global nondegenerate 2-form to on M satisfy- 
ing dco = 0 is called a symplectic manifold, and co is called the symplectic 
form. (As usual, co is nondegenerate if co(X, Y) = 0 for all Y implies that 
X = 0.) A famous theorem of Darboux from 1884 guarantees that, on such a 
manifold, we can always find local coordinates q l , . . . , q n , p\, . . . , p n such 
that 


n 

co = dq 1 A dpt. 

i=i 

Symplectic manifolds are the natural setting for classical mechanics. 

(a) Compute co n (meaning the wedge product of co with itself n times) explic- 
itly and thereby demonstrate that it is a volume form on M , namely a 
global nowhere- vanishing form of degree 2 n = dim M. In particular, this 
shows that M is always orientable. (See Appendix C.) 
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(b) If M and N are symplectic manifolds with symplectic forms p and v, 
respectively, then a smooth map f : M —>■ N is called symplectic (or, in 
physics language, a canonical transformation) if f*v = p. Show that 
/ is necessarily a local diffeomorphism. For this reason, / is often called 
a (local) symplectomorphism. Hint: Show that / preserves the volume 
form co n and use the inverse function theorem. 

(c) Let M be symplectic with symplectic form co, and suppose that / : M —> 
M is a smooth function. As co is nondegenerate, there exists a unique 
vector field X f associated to / according to 


ix f oo = u>{Xf, •) = df. 

The field X f is called the Hamiltonian vector field associated to the 
function /. Show that, in local coordinates, 



(d) If M is the phase space of some dynamical system then there is a distin- 
guished function H on M, called the Hamiltonian of the system. Show 
that y{t) = (q l (t), ... , q n (t), p\(t), ... , p n (t )) is an integral curve of 
Xh if Hamilton’s equations hold. They are 

3// . dH 

q = , p: = r , i — 1 , . . . , n , 

dpt dq 1 

where q := dq/dt. 

(e) Show that if y(t) is an integral curve for X H then dH(y(t))/dt = 0 (i.e., 
the Hamiltonian is constant along the curve). When H is the energy of 
the system this is just the statement that energy is conserved. 

(f) Let <p t be the flow of Xh ■ Show that tp, is a symplectomorphism. Hint: 
Use Cartan’s formula to show that £x u 0 ) = 0. Remark: As the volume 
form is preserved by < p t , this proves Liouville’s theorem, which says that 
canonical transformations preserve volume in phase space. 

(g) The Poisson bracket of two functions / and g on M is defined by 


{f,g} :=o>(X f ,X g ). 

Show that, in local coordinates, 

1,8 h^‘ dql dp ‘ 3q ‘ dp ‘J 

(h) Show that the smooth functions on M equipped with the Poisson bracket 
form a Lie algebra. Hint: Clearly the Poisson bracket is bilinear and skew 
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symmetric, so the only nontrivial part of the assertion is that the Poisson 
bracket obeys the Jacobi identity 

{f,{g,h}} + {g,{h,f}} + {h,{f,g}} = 0. 

This is a rather unpleasant computation if local coordinates are used. 
Instead, compute da>(X f, X g , X/,) using (3.122). You will get two types 
of terms, which you need to simplify. For the terms of one type, the iden- 
tity {f, g} = —Cxfg (which you should prove) comes in handy. For terms 
of the second type, Cartan’s formulas are useful. 


4 

Homotopy and de Rham cohomology 


The most intuitively evident topological invariant of a space is the num- 
ber of connected pieces into which it falls. Over the past one hundred 
years or so we have come to realize that this primitive notion admits in 
some sense two higher-dimensional analogues. These are the homotopy 
and cohomology groups of the space in question. 

Raoul Bott and Loring Tu ' 

In physics one learns that a static electric field E is conservative, or curl free: 

Vx£ = 0, (4.1) 

where V is the usual gradient operator (d/dx, d/d y, d/dz)- From this, one 
concludes that 


E = V® (4.2) 

for some scalar field <t>. In fact, there is an explicit formula for 4>. namely the line 
integral 

<F = J E-ds. (4.3) 

In terms of differential forms, we can define a 1-form E = E x dx + E y dy + E z dz, 
so that (4.1) is just the condition dE = 0 while (4.2) is the condition E = d<t> 
for a smooth function <f>. Obviously E — d<& implies clE = 0, because mixed 
partials commute (d 2 = 0). This example shows, however, that at least in this case 
the converse is also true: dE = 0 implies E = for some 4>. 

We now proceed to generalize this example. Let co be a &-form on a smooth mani- 
fold M. If da> = 0 we say that o> is closed, while if co = da for some (k — l)-form 

f Excerpt from [12], p. 1 . Reprinted by permission. 
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a we say that co is exact. Every exact form is closed, because d 2 = 0. The ques- 
tion arises, though, whether every closed form is exact. In the above example we 
saw that, on M 3 at least, every closed 1-form is indeed exact. It turns out that a 
similar statement holds for a form of any degree on M", and this is the content of 
the Poincare lemma (Theorem 4.1). But the corresponding statement is false for a 
general manifold, for subtle reasons having to do with the topology of M. 

To get a feeling for what is involved, consider the following 1-form on M 2 — {0}: 

“ = (pT f) dx + (?f?) iy ' 

This form has the interesting property that it is closed but not exact. That is, dco = 0 
(check!) but there is no function / such that co = df . How do we know this? Well, 
just integrate. If there were such a function, we would have 

9/ _ y 9/ _ X 

dx x 2 + y 2 ’ dy x 2 + y 2 

Solving these equations for / gives 

f(x,y) = tan -1 Q . 

A simple calculation shows that co = df, which appears to contradict our previous 
assertion. But in fact, there is no contradiction, because f is not a function! For 
example, /( 1, 1) = n / 4, 97r/4, Yln/4, . . . , so it is not single- valued. 1 

Notice that the example only works because we have removed the origin; co 
would not be well defined otherwise. Having removed the origin, the problem is 
then that / is not single-valued, because going around the origin on a circle, or 
more generally on some simple closed curve, brings us back to a different value 
for /. Somehow the existence of a form that is closed but not exact in the above 
example seems to be due to the fact that the space M 2 — {0} has a hole in it. The 
content of the Poincare lemma is that this never happens on M” because M" has no 
holes; it is topologically trivial. 


4.1 Homotopy 

To make this precise, we need some definitions. Let X and Y be topological spaces, 
and let I = [0, 1] e M. Two continuous maps f,g:X^Y are said to be 


1 Technically, / is a multivalued function. 
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t= 1 t = 2/3 t = 1/3 t = 0 

Figure 4.1 A representation of the homotopy in Example 4.1 between a cylinder 
and a disk. 

homotopic, written / ~ g, if there exists a continuous map F : I x X — > Y 
(called a homotopy) such that 

F(0, x) — f (x), (4.4) 

F(l, *) = $(*)• (4-5) 

If there exist continuous maps / : X — > Y and g : Y — > X such that g o f ~ idx 
and / o g ~ id y , where id denotes the identity map, then X and Y have the same 
homotopy type (or, in an abuse of terminology, are homotopic), written X ~ Y. 
Intuitively, homotopic spaces can be continuously deformed into one another. 

Example 4.1 (The cylinder constmction) Let D be a disk in R 2 . Then the cylinder 
I x D is homotopic to D. Let / : I x D — > D be given by (s , y) i-> y, and let 
g : D I x D be given by y m- (0, y). Then fog = ido, while go f ~ idj x o via 
the homotopy F(t, (s, y ) ) = (ts, y). Intuitively, the cylinder can be squashed onto 
the disk in a continuous fashion. (See Figure 4. 1 .) Of course, the same thing happens 
if we start with any space X in place of the disk D. 

A space X is contractible if it is homotopic to a point. In that case, there exist 
continuous maps / : X —>■ {p} and g : {p} — > X such that / o g = id p and 
g o f ~ idx- 

EXERCISE 4.1 A map / : X —> Y is null homotopic (or homotopically trivial) if 

is is homotopic to a one-point map, namely a map sending all points of the domain 
to a single point in the range. Show that X is contractible if and only if the identity 
map on X is null homotopic. 

EXERCISE 4.2 Let X be any space, and define the cone over X, denoted CX, to be 
the quotient space (/ x X)/ where (1, x) ~ (1, y) for all x, y e X? Intuitively, 
construct a cylinder over X and then pinch off one end. (See Figure 4.2.) Show that 
a cone is always contractible. 

2 Be sure to distinguish the two different meanings of / ~ g means that / and g are homotopic whereas 
(1, x) ~ (1, y) means that the two points are identified. Sorry, but both usages are pretty standard. 
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Figure 4.2 A cone over a disk. (This object is solid because the disk is solid.) 



Figure 4.3 The suspension of a circle. (This object is hollow because the circle is 
hollow.) 

Example 4.2 The suspension of a space X, denoted EX, is the quotient space 
(/ x X)/ where (0, x) ~ (0, y) and (1, x) ~ (1, y) for all ijeH. Intuitively, 
glue two cones together along the base space X. Figure 4.3 depicts the suspension of 
a circle. Apparently, E.S' 1 = S 2 . Indeed, one can show that the suspension of S" is 
homeomorphic to S n+1 ? 

If X and Y are homeomorphic then they are homotopic, because we can take 
g = f~ l in the definition, but the converse is false. One way to see this is just 
to note that homeomorphisms preserve open sets whereas a contractible space, 
such as an open disk, is homotopic to a point (a closed set). This means that 
homotopy is a weaker notion than homeomorphism. This is both good and bad. 
It is good because being a weaker notion means that homotopies are often eas- 
ier to deal with than homeomorphisms (some of which can be really ugly) and 
because homotopy equivalence is a topological invariant. It is bad because homo- 
topy equivalence is not a complete topological invariant, so knowing which spaces 


3 


A formal proof of this fact can be found in [75], p. 334, although a more elementary proof proceeds from the 
notion of a join of simplicial complexes. See Exercise 5.5 at the end of Chapter 5. 
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are homotopy equivalent does not solve the fundamental problem of topology, 
which is to determine which spaces are homeomorphic. 


4.2 The Poincare lemma 

With that little bit of elementary homotopy theory out of the way, we can now turn 
to the property of Euclidean space alluded to in the introduction. 

Theorem 4.1 (Poincare lemma) Let U C M" be contractible. Let co e L2 k+l (U) 
be closed. Then <o is exact, i. e . , there exists an a e Ll!' iU ) such that co = da. 

Proof Consider the cylinder / x U, and let s 0 : U — »■ I x U and s\ : U I x U 
be the inclusion maps into the bottom and top of the cylinder given by jc i — ^ (0, x) 
and x i — > (1, x), respectively. As U is homotopic to a point, there is a homotopy 4 
F : I x U -> U such that F(0, x) = x 0 and F(l, x) = x, which is the same thing 
as saying 


F o s 0 = x 0 , 
F osi = x. 


Taking pullbacks gives 

(4.6) 

(4.7) 


s* o F* = 0 (the zero map on forms), 
s* o F* = 1 (the identity map on forms). 


The theorem will follow if we can show that there exists an operator 

h : Q k+1 (I xU)^ 

called a homotopy operator, satisfying 

hd + dh = sl-s%. (4.8) 

Given such an operator, we may apply (4.8) to the form F*co and use (4.6) and 

(4.7) to get 

hdF*co + dhF*co = (s* - s£)F*co = co. 


4 Technically we ought to distinguish between continuous and smooth homotopies, but fortunately the two 
notions are essentially equivalent for smooth manifolds. More precisely, Whitney’s approximation theorem 
says that every continuous map of smooth manifolds is continuously homotopic to a smooth map (see e.g. 
Proposition 17.8 in [12]). Therefore, we may safely take all our homotopies to be smooth maps. 
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But dF*co = F*dco = 0 because co is closed, so we conclude that 

co = d(hF*co), 

i.e., co is exact. 

To show that li exists, we construct it. Any form i] on I x U is a sum of terms of 
the following types: 

type I, a{t,x)dx J \ 
type II, a(t, x) dt A dx K . 

We define the homotopy operator h by setting 
h{a{t, x) dx l ) = 0, 


h(a(t, x) dt A dx J ) = 


(i “ ( 


x) dt I dx J , 


and extending by linearity. 

On forms rj of type I we have 

3 a 


dri = — dt A dx J + terms not containing dt, 
dt 


so, as hr] = 0 by construction, 

hdif + dhrj = 


dx J 


-«» 


= [a(l, x) — n(0, x)] dx J 
= (s* — Sg)(a(t, x) dx J )\ 


thus (4.8) is satisfied in this case. 
On forms rj of type II we have 


hdrj — h 


( x — ' 

— > — r dt A dx A dx 

^ dx 1 


1 dci i , 

— r dt ) dx A dx 
dx 1 


and 


dhr) = d 


dx 1 A dx J 


I a(t, x) dt ) dx J 
o 

a r f l 

= > — r / a(t,x)dt 

^ dx 1 [ Jo 

= (/ dt^j dx' A dx J . 

But SqIj = s*rj = 0 (because dt 0), so again (4.8) is satisfied. 


□ 
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EXERCISE 4.3 Let 

co — Ady A dz + B dz A dx + C dx A dy (4.9) 

be a closed 2-form in R 3 , so that dco = 0. The space R 3 is contractible via the 
homotopy 


F(t, x, y, z ) = (tx, ty, tz), 


(4.10) 


so by the Poincare lemma we must have co = dhF*co. We wish to verify this 
explicitly. 

(a) Show that 


-(/: 


hF co = yj A(tx, ty, tz) t dt J (ydz — z dy) 

+ B(tx, ty, tz) t dt\ (zdx — x dz) 

+ ([ C(tx, ty, tz) t dt] (x dy — y dx). 


(b) Show that co = dhF*oo. Hint: It suffices to obtain only one term in co, say the 
last, because the others follow similarly by cyclic permutation. At one point in 
the calculation you may want to use the fact (which you should justify) that 

d 

t — C(tx, ty, tz) — (x ■ W)C(tx, ty, tz). 
dt 

You may also want to integrate by parts somewhere. 


Remark In Exercise 3.28 you showed that Maxwell’s source-free equations can 
be written dF = 0, where F is the electromagnetic 2-form. Therefore, by the 
Poincare lemma in M 3 , there exists a 1-form A such that F = d A: A is called the 
vector potential because its dual is the usual electromagnetic vector potential. 


4.3 de Rham cohomology 

Recall that Q k (M) denotes the vector space of all k -forms on M. It has two distin- 
guished subspaces: Z k {M), the space of all closed k -forms on M, and B k (M), the 
space of all exact fc-forms on M. As we have seen, all exact forms are closed, so 
we have the inclusions 


B k (M) C Z k {M) C Q k (M). 


(4.11) 
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We seek a convenient measure of the extent to which a closed form fails to be 
exact. Such a measure is provided by the kth de Rham cohomology group 
which is the quotient space of the closed k- forms modulo the exact 

k-forms: 


H k R (M) = Z k (M)/B k (M). (4.12) 

Put another way, two closed forms are cohomologous (equivalent) if they differ 
by an exact form, and H^ R (M) is the set of all cohomology classes (equivalence 
classes). The remarkable thing about the cohomology groups of a manifold is 
that they are topological invariants, so that two manifolds with the same topol- 
ogy have the same cohomology groups. One simple illustration of this is provided 
next. 

Let X be a space. If there do not exist two disjoint open sets U and V whose 
union is X then X is connected. Two points x and y are in the same connected 
component of X if there is a connected subset of X with x, y e X. The number 
of connected components of a space is probably the simplest and most important 
topological invariant of a space. 

Theorem 4.2 Let M be a smooth manifold. Then 

dim = the number of connected components of M . (4.13) 

Proof No 0-form / on M is exact (there are no (— l)-forms!), so every closed 0- 
form defines its own cohomology class. But df = 0 if and only if / is constant on 
each connected component M, of M , and a basis for the vector space of constant 
functions on the connected components is provided by the unit functions { \ Mi }. □ 

A slight generalization of the proof of Theorem 4.1 yields the following 
important result. 

Theorem 4.3 If f, g : M — »■ N are homotopic maps then f* = g * : H* R (N) — »■ 
H‘ R (M)f In other words, homotopic maps induce the same map in cohomology. 


5 If the word “group” bothers you, you can safely replace it here with the phrase “vector space,” because the 
de Rham cohomology group H^ R (M) (over the reals) is always a vector space. In more general cohomology 
theories one usually works with coefficients chosen from an abelian group, in which case one obtains honest 
cohomology groups. For our purposes here, which is simply to illustrate the connection between differential 
forms and topology, the de Rham cohomology is adequate. But the reader should be aware that other cohomol- 
ogy theories exist that sometimes provide more detailed topological information about manifolds. (Specifically, 
they provide information about something called torsion.) 

6 We follow the standard if somewhat cavalier notation in which //* R (M) stands for H^(M) and all maps 
respect the grading, so that cohomology in one degree is mapped to cohomology in the same degree. 
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Proof If /, g : M — >• N are homotopic then by (4.4) and (4.5) there is a homotopy 
F : I x M N such that 


F os 0 = f, 

F o Si = g, 

where so,s\ : IV — ► I x M are the inclusion maps into the top and bottom of the 
cylinder / x M. Hence 


r = *o ° F 

g* = s* o F*. 

The proof of the existence of the homotopy operator in the Poincare lemma carries 
over patch by patch from M" to M, allowing us to conclude that there is a homotopy 
operator h : I x M — > N satisfying 

hd + dh = s* - s*. (4.14) 

But the operator on the left of (4.14) maps closed forms to exact forms, which 
means that it is the zero operator on cohomology. Thus 

f* ~ 8* = (s* - s* 0 )F* (4.15) 

is also the zero operator on cohomology. □ 

Corollary 4.4 (Homotopy invariance of de Rham cohomology) If M and N are 
homotopic then H* R (M) = H' R (N). 

Proof Now, if M and N are homotopic then there exist / \ M —> N and g : N -> 
M such that / o g ~ id^ and g o f ~ idw Hence g* o f* equals the identity on 
H’ r (N) and /* o g* equals the identity on H* R (M). In other words, /* and g* are 
isomorphisms and H' R (M) = H' R (N). □ 

Homeomorphic manifolds are homotopic, so it follows that the de Rham coho- 
mology group of a manifold is a topological invariant. But homotopic manifolds 
are not necessarily homeomorphic, so it is not clear at this point whether the de 
Rham cohomology of a manifold is a complete topological invariant. In fact, it is 
not; there are many distinct manifolds with the same cohomology groups. Still, the 
nice thing about cohomology groups is that there are many techniques with which 
to compute them and if two manifolds have different cohomology groups then you 
know that they are topologically distinct. 
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Example 4.3 The space M" is contractible, so it has the homotopy type of a point. 
It follows from Theorem 4.2 and Corollary 4.4 that 


/4(R") 


| M, if k = 0, and 
1 0 otherwise. 


(4.16) 


Example 4.4 A retraction r : X — > Y of a space X onto a subspace V is a 
continuous map that restricts to the identity on Y . In symbols, r o i = idy , where 
i : Y — > X is the inclusion map. If we also demand that i o r ~ idx then r is 
called a deformation retraction of X onto Y . A deformation retraction is a well- 
behaved sort of retraction, which collapses X onto Y in way that can be inverted up 
to homotopy. If r is a deformation retraction then r and i are homotopy inverses, and 
Theorem 4.3 tells us that r* : / / * R ( Y ) — > // * k ( X ) is an isomorphism. 

To illustrate this, consider the punctured Euclidean space X := R" +1 — {0}. The 
sphere S" is a deformation retract of X via the map r : X -* S" given by x i— >■ 
jc /|| jc || . To see this, let t : S" —> X be the inclusion map. Then r o l = ids" and 
ior~ idx, via the homotopy 

F(t, x) = (1 — t)x + fx/||x||, 

say. Therefore, by Corollary 4.4, r* : H' R (S n ) fi* R ( X ) is an isomorphism. (In 

Exercise 4.7 you can compute the cohomology of the spheres.) 


Remark Observe that = 0 if k > dim M. 


4.4 Diagram chasing* 

Over the years a large number of techniques have been developed to compute var- 
ious cohomology groups, of which the de Rham cohomology groups are arguably 
the most important. In this section and the next we describe some of the powerful 
machinery behind these techniques. 

Let C = ©,- e zC' be a direct sum of vector spaces. It is called a differential 
complex if there exist linear maps d, : C' C' +1 such that di + \d, = 0. The 
map d, is often called the differential of the complex, even though it may have 
nothing to do with differentiation. We represent a differential complex pictorially 
by a sequence 

... > C'- 1 C‘ — C' +1 ••• < 4 - 17 ) 

with the property that im d, C kert/,+ 1 . The cohomology of the complex C is 
the direct sum H(C) = ©/ e z#'(C), where Z'(C) := ker d{, B'(C ) := imt/,_i, 
and H 1 (C) = Z'(C)/fi'(C). The elements of Z'(C) are called /-cocycles and the 
elements of B‘ (C) are called /-coboundaries (for reasons that will become clear 
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in Chapter 5). In general the sequence (4.17) is not exact, and the degree to which 
it fails to be exact is measured precisely by its cohomology. 

The de Rham complex is just H' R (Q*(M)) equipped with the exterior deriva- 
tive operator, and the de Rham cohomology is just the cohomology of this 
complex. The cocyles of the de Rham complex are the closed forms and the 
coboundaries are the exact forms. For this reason, even for an arbitrary com- 
plex, we sometimes refer to a cocycle as being closed and a coboundary as being 
exact. 

Suppose that we have two complexes, (A, d A ) and ( 71, d B ). A chain map cp : 
(A, d A ) ( B , d B ) is a collection of homomorphisms i p, : A' — »■ B‘ that commute 
with the differential operators: (p i+l d A j = d B ,i<Pi ■ A pair of complexes related in 
this way by a chain map is called a homomorphism of complexes and is elegantly 
represented by the following commutative diagram.^ 


lpi+1 . 


di 

A i 



di — 1 


di - i 


A i~l Vi i > B i - 1 


(4.18) 

where here and henceforth we suppress the complex labels on the differential oper- 
ators to avoid cluttering the notation. This diagrammatic notation means that we 
get the same result no matter which way we choose to follow the arrows. The dia- 
gram is called commutative because the path independence condition is precisely 
equivalent to the statement that <p commutes with d. 

EXERCISE 4.4 Show that a homomorphism of complexes gives rise to a homo- 
morphism of cohomology groups. That is, show that there is a natural linear map 
hi : H 1 (A) — » H‘(B) for all i. Hint: Let a e Z 1 (A) be closed and denote its 


7 Warning: The quantity B is now a complex, so it consists of vector spaces B ' . Do not conflate the vector space 
B‘ with B' (B), the vector subspace of -coboundaries of IV . To ensure that we do not fall into this trap, we 
will write imd B ,_j rather than IV ( IV) whenever there is a danger of confusion. 

8 For the sake of future applications it is convenient to draw the differentials vertically and the chain maps 
horizontally. 


4.4 Diagram chasing : 


127 


equivalence class in H' (A) by [a]. Define /?, ([a]) = [(pi («)]. Show that the map 
is well defined, i.e., that it maps cohomology classes to cohomology classes and is 
independent of class representative. 


Suppose now that we have three complexes, A, B. and C, together with chain 
maps (p : A -> B and \j) : B C. Furthermore, suppose that 

0 * A‘ — B‘ C‘ > 0 

is a short exact sequence for each i. Then we have what is called a short exact 
sequence of complexes: 


0 > A l+1 Vl+l > B l+l 1/18+1 > C i+1 » 0 

di di di 

0 ♦ A 1 B l C l > 0 

di — i di — i di — i 

0 » A l ~ l ^ > B l ~ l 1/18-1 > C l_1 ^ 0 


(4.19) 

A short exact sequence of complexes gives rise to a long exact sequence in 
cohomology. More precisely, we have 



(4.20) 

The curved arrows labeled 8 in (4.20) are called coboundary operators (or 
connecting homomorphisms) and they are somewhat tricky to define. Let [c] e 
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H' ( C ). We want to define its image in H' +l ( A ) under the map <$, . Consider (4. 19) 
again. Choose any c € [c]. By the surjectivity of \j /, . there exists an element b e B' 
such that \l/jb = c. But, by commutativity, \l/ i+ \djb = d, i //, b = dp: = 0. Hence, 
by the exactness of the top row, there exists an a e A l+1 such that (pj + \a = djb. 
It turns out that a is closed, and we define 8, [c] to be the cohomology class [rz] in 
H i+l (A). 

We need to show two things: first that a is closed and second that the coboundary 
operator is well defined, in the sense that it does not depend on any of the arbitrary 
choices we have made at each step in the process of going from [c] to [a]. For the 
first, note that (p i+2 di + \ci = di + \tpi + \a = d j+ \d ! b = 0, so, by the injectivity of 
(Pi+ 2 , d i+ \a = 0. For the second, well, it’s a useful exercise to show this. 

EXERCISE 4.5 Show that the coboundary operator Sj is well defined. Remark: After 
completing this exercise you will appreciate why it is called “diagram chasing”. 

EXERCISE 4.6 Show that the sequence (4.20) is exact. Hint: Label the maps 
Hj(A) —*■ Hj(B ) and Hj(B ) — > //, (C) by a; and /f respectively, so that a,- [a] = 
and [b] = [i//,/?]. To show exactness at //, (4), for example, show separately 
that i m <5; _ i C ker a, and kem, C im<$,_] . Do something similar at Hj(B ) and 
Hj(C). Remark: If you find yourself lost amongst the arrows, you can consult e.g. 
[37], pp. 113ff. (Hatcher is working in homology instead of cohomology, but the 
approach is the same.) 


4.5 The Mayer- Vietoris sequence* 

The Mayer-Vietoris sequence is an important piece of technology that allows one 
to compute the de Rham cohomology of a space in terms of its pieces. Like much 
of modern technology, however, its inner workings are somewhat complicated. 
Nevertheless, it is worth discussing as an illustration of how to perform simple 
computations in cohomology. 9 

Suppose that M is the union of two open sets U and V. Then there is a natural 
sequence of maps 


m — u \}v — unv, 

l V 

where ]J denotes a disjoint union and ly and iy are the inclusion maps of U D V 
into U and V , respectively. The map cp is just the inclusion map of each individual 

^ It may also be good for your health. Leopold Vietoris (1891-2002) lived to the age of 1 10! (By contrast, Walter 
Mayer (1887—1948) was only 60 at the time of his death.) 
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subset U and V into M. Hence, by the properties of the pullback map, we get a 
sequence of maps of differential forms, 

n*(M) — ^ n*(i/nv). 

4 

Combining the last two maps into a single map by taking their difference, we get 
an exact sequence. 

Lemma 4.5 The sequence 

o * q *(M) — n*(t/) ® n*(V) — ^ n*(i/nv) > o, 

where v) := v — pi, /iv exact. It is called the (short) Mayer-Vietoris 

sequence. 


Proof To show exactness at the first term, we must show that (p* is injec- 

tive. But this is clear, because if a form vanishes on both U and V then it must 
vanish on U U V = M, so the only thing sent to zero by <p* is the zero form. Next, 
suppose we have a form q on M. Then it maps to (q. q) under (p* (because it is a 
global form on M and so is the same on U or V ), whereupon it is killed by \// * . 
Hence the sequence is exact at the middle term. 

To show exactness at the last term, we must show that xf* is surjective. That is, 
given a form co e £2 *(17 D V ), we must show there exist forms n e Q*(U) and 
v G £2*(V) such that co — v — pi. To do this, we assume for the moment that there 
exist smooth functions X v and Ay that have very special properties: (i) A -u,ky > 0, 
(ii) kjj + Xy = 1, and (iii) suppfAy) c U and supp(Ay) c V, where supp(/) is 
the support of /, namely the smallest closed set containing all the points where / 
is nonzero. The pair of functions {Xu, X } is called a partition of unity subordinate 
to the open cover {17, V}. 10 Given such a partition, define 


pi := 


— XyCO 

0 


on 17 n V, 
otherwise, 


and 


X uco on 17 Pi V, 
0 otherwise. 


Then pi is a smooth form on U and v is a smooth form on V, and v — pi = Xuco + 
Xyco = co, as desired. 1 1 


10 In general, a partition of unity on a smooth manifold M is an at most countable collection of nonnegative 

smooth functions {A/} such that (1) {suppAj} is locally finite, and (2) ^i(p) = 1 f° r a U p € M. If 

{suppAj} is a refinement of an open cover of M we say that the partition of unity is subordinate to the open 
cover. 

11 At first sight it may seem a bit strange that, to define a 1-form on U, we multiply co by Xy instead of 
Af/. The reason why we do this is rather subtle, and is best illustrated by a picture (Figure 4.4). Consider 
the special case in which M is one dimensional and co = f, a smooth function. For simplicity, suppose 
that / is the constant function 1 on U fl V, as shown in Figure 4.4(a). By the properties of a partition of 
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/ / 



V 

(a) (b) 

Figure 4.4 (a) A partition of unity subordinate to the cover {U, V}. (b) Weighted 
functions. 


The only task remaining is to prove the existence of a partition of unity with the 
desired properties, and for this we refer the reader to [88], Theorem 1.11, or [51], 
Theorem 2.25. □ 

There is one short exact Mayer- Vietoris sequence in each degree, and the 
sequences are all related by the exterior derivative. Putting them all together gives 
a short exact sequence of complexes: 







d 

n i+ 1 (u)®n i+ 1 (v) 

d 


w + 1 (unv) 

d 

— ♦ 0 

W{M) 

d 

n i {u)®n i (v) 

d 

V 

n i (unv) 

d 

— > 0 




r'ft/nk) 

> 0 


(The diagram is commutative because the exterior derivative commutes with pull- 
backs.) We therefore obtain a long exact sequence in cohomology, called the (long) 
Mayer-Vietoris sequence: 


unity, the functions A .\j and Xy must take the form shown. But then the product functions Xu f and Xy f 
would take the shapes displayed in Figure 4.4(b). Clearly, Xu f is smooth on V but not U, and vice versa 
for Xy /.) 
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(4.21) 


Remark It is sometimes useful to have an explicit expression for the coboundary 
map 8j. To this end, let co € Q 1 (U n V) be a closed form. As xfr* is surjective, there 
exists an q e £2'((/) © 12' (V) such that x//*q = co, namely q = (— Xyco, Xuco). 
By commutativity xf/*dq = d \jj * q = dco = 0, so by exactness there exists a 
£ e £T +1 (M) such that cp*% = dq. Again by commutativity, <p*d i- = dcp*% = 
d 2 q = 0 so, by the injectivity of cp*, % is closed and hence it defines a cohomology 
class in H ,+l (M). We therefore choose S,([o)]) = [£]. According to the result in 
Exercise 4.5, this map is well defined. Now, <p is the inclusion map of U and V into 
M, so (p* is just the restriction map of M onto U or V. In particular - , cp*% = dq = 
(d(— X v co), d{Xuco)) implies that £ restricted to U is —d(X v co) and £ restricted 
to V is d(Xuco). This is consistent, because \jf*dq = 0 implies that —d(Xvco) = 
d{Xu(o) on the overlap U H V. Thus § is globally defined and 




[— d(Xyco)] on U, 
[fif (!(/&>)] on V. 


(4.22) 


Example 4.5 As a simple example of the Mayer-Vietoris machine in action, we 
will compute the de Rham cohomology of the circle S l . We cover the circle with 
two open sets U and V, corresponding to the upper and lower “hemispheres”, 

U 




V 


unv 

(b) 


Figure 4.5 A covering of the circle. 
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12 


respectively, as shown in Figure 4.5(a); then U C \ V consists of two pieces, each 
homeomorphic to R, as shown in Figure 4.5(b). As U and V are each homeomorphic 
to R as well, the Mayer- Vietoris sequence becomes 1 2 



By Theorem 4.2 the zeroth cohomology of a manifold is just the number of con- 
nected components, and the cohomology generators are the constant functions on 
each piece. The circle is connected, so H® R (S l ) = R. Next, let p be a constant func- 
tion on U and v be a constant function on V. Then i[r*(p., v) = (v — p,, v — fi)\unv> 
a pair of identical constant functions on the pieces of U fl V , so the rank and nullity 
of i jr* are both 1. (For example, if lj/ is the function that is 1 on U and ly is the 
function that is 1 on V then a basis for the kernel is ( \p , ly), because 1 f j = 1 y on 
the overlap.) By exactness dimkeri5o = rk \[r* = 1, so as dimker^o + rk<5o = 2 
we have rk<5o = 1. But //] R (.S -1 ) maps to zero, so again by exactness it must be the 
entire image of <5 q, namely R. In other words, A/Jr^ 1 ) = R. Summarizing, we have 




fR, if A: = 0, 1, and 
1 0 otherwise. 


(4.24) 


As this differs from the cohomology of R, it confirms the obvious fact that the circle 
and the line are topologically distinct. 

A basis element of the vector space H^ R ( M ) is called a generator of the cohomol- 
ogy of M in degree k. There are many different types of cohomology generators, 
but typically one chooses a generator of the top-degree cohomology of M to be a 
bump form, which is a form with compact support and finite integral. To illustrate 
the construction in the case of S l , let (Xp, Xy) be a partition of unity subordinate 
to the cover {17, V] and let / be a function that is 1 on one component of U fl V, 
say, the left-hand component in Figure 4.5(b), and 0 on the other. Then, by (4.22), 
S 0 f (a global 1-form on S l ) generates the cohomology of .S’ 1 in degree 1. The 1-form 
S 0 f equals — d(Xy f ) on U and d{Xp f) on V, and is a bump form with support in 


In the Mayer- Vietoris sequence we write R © R for H(U fl V) to emphasize that there is one cohomology 
generator from each piece of the overlap. But we could just as well have written R 2 instead. When there are 
many generators people usually drop the unwieldy direct sum notation. 
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U 






Figure 4.6 A bump form on .S' 1 . 


the left-hand component of U fl V, as shown in Figure 4.6. (This 1-form clearly has 
compact support, because its support (a closed set) lies in U fl V (a bounded set). 
Also, its integral is finite because Xyf and Xyf are bounded functions.) 13 


EXERCISE 4.7 Let S " be the n-sphere, namely {x e R" +1 : ||x|| = 1}, where 
|| jc || = (£A(x ! ) 2 ) 1,/2 is the standard Euclidean norm. By analogy with the previous 
example, use the Mayer-Vietoris sequence to show that 


Hi R (S n ) 


|]R if k = 0 or k = n, and 
1 0 otherwise. 


(4.25) 


In particular, the n-sphere is not contractible, a result that is easy to see intuitively 
but difficult to prove directly. Hint: Cover S n with two patches, and argue that the 
overlap is homotopic to a familiar space. Remark: As in the previous example we 
can choose a generator of H£ R (S n ) to be a bump form, although it is not necessary 
to know this in order to compute the cohomology. 


Example 4.6 We can use the result of this last example to prove a famous theorem 
in topology called the Brouwer fixed point theorem. Let B" denote the n-ball, 
namely the set {x e R" : ||x|| < 1}. The Brouwer fixed point theorem asserts that 
any continuous map / : B" — > B" has a fixed point, namely a point x such that 


13 


Integration is discussed in detail in Chapter 6. 
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f(x ) = x. This theorem has a large number of applications in many diverse fields, 
such as game theory, economics, and differential equations. 14 

We first show by contradiction that there is no retraction r : B" +x — > S" of the 
n-ball onto its boundary. Suppose that there were. Then, by pulling back everything, 
we would have the composition 

HZ r (S”) — ^ H" R (B n + l ) H" R (S n ), (4.26) 

where t* o r* — 1. But the ball is contractible, so II ( B n+] ) vanishes whereas 
1 1 Hr ( S " ) does not, a contradiction. 

Now suppose that / : B" — > B" has no fixed point. Then fix) ^ x for all 
x e B" . Let g : B" .S'" 1 be the map 


g(x) = 


x - f(x) 
x - f(x) || ' 


But this is a retraction of the ball onto its boundary, a contradiction. 


(4.27) 


Additional exercises 

4.8 Homotopic maps are equivalent Show that homotopy is an equivalence 
relation on maps, i.e., that for any continuous maps /, g,h : X Y, 

(a) / ~ /, 

(b) / ~ g implies g ~ /, and 

(c) / ~ g and g ~ h implies / ~ h. 

4.9 Homotopy and composition of maps Let f,g:X—^-Y and h : Y — »■ Z 

be continuous maps. Show that if / ~ g then li o f ~ li o g. Similarly, if 
/ : X — > Y , g, h : Y — > Z, and g ~ h, show that go/~/io/. 

4.10 Homotopic spaces are equivalent Show that homotopy is an equivalence 
relation on spaces, i.e., that for any spaces X, Y, Z, 

(a) X ~ X, 

(b) X ~ Y implies Y ~ X, and 

(c) X ~ Y and Y ~ Z implies X ~ Z. 

4.11 The fundamental group Let X be a topological space, and let y be a 
path in X from p to namely a (continuous) parameterized curve y : 
[0, 1] X with p := y(0) and q := y(l). The points p and q are 
the starting point and ending point, respectively. Given two paths y\ : 
[0, 1] — > X and y 2 : [0, 1] — > X with the property that one ends where 


14 


Technically, we will only prove Brouwer’s theorem for smooth maps. Fortunately, a similar argument works 
in the continuous case by the use of singular homology (see Section 5.2) rather than de Rham cohomology. 
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Figure 4.7 The product of two paths. 

the other starts, i.e., yi(l) = y 2 (0), their product y := y\y 2 is just the 
concatenation of y\ and y 2 , rescaled so that the parameter domain remains 
[0, 1]. Explicitly, 


y(t) = 


Y i (20, 

yi(2t - 1) 


if 0 < t < 1/2, and 
if 1/2 < t < 1. 


(See Figure 4.7.) The inverse path y~ l is just the path that goes backwards 
along y from q to p: y _1 (0 y( 1 — 0- The identity path at p, id p , is the 
path id p (t) = p for all t e [0, 1]. 

Two paths a and ft from p to q arc path homotopic (denoted a ~ /?) if 
there is a homotopy F : [0, 1] x [0, 1] — > X from one to the other preserving 
the endpoints, i.e., such that 


F( 0, t) = a(t), 

F(i,t) = m, 

with F(s, 0) = p and F(s, 1) = q for all 5 e [0, 1]. (See Figure 4.8.) 

(a) Show that path-homotopy is an equivalence relation. 

(b) Define the natural product on path-homotopy equivalence classes, namely 
[o/Hjd] := \afi ] (whenever afi is defined). Show that this is well defined, 
namely, independent of class representative. 

(c) Define the inverse of a path-homotopy equivalence class by [a] -1 := 
[cr -1 ]. Verify that this, too, is well defined. 



Figure 4.8 A path-homotopy from a to /S. 
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(d) A loop based at p is a path that starts and ends at p. Denote by jt\ ( X , p ) 
the set of all path-homotopy equivalent loops based at p. Show that 
7t\ ( X , p) is a group under the operations defined above, where the identity 
element is naturally \id p |. The group 7t\ (A, p) is called the fundamen- 
tal group or first homotopy group of X. It is one of the most important 
topological invariants of a space. (For more on the fundamental group see 
e.g. [55].) Hint: Show that all four group axioms hold (cf. Appendix A). 
Be careful. You are probably so used to thinking that gg~ { = e that you 
might just assume that aa -1 = id, but this is not true. If a is a non- 
trivial loop based at p then aa" 1 is a loop that travels from p around 
a to p then back along a~ l to p again. This is not the same thing as 
just standing in place. However, it is true that [a][a] -1 = [id p ], because 
we just shrink everything back to p. Similarly, (afi)y ^ a(( J >y) in gen- 
eral (because the parameterizations are different), but you should show 
that ([a][/3])[y] = [a]([/3][y]). Associativity is actually the hardest part, 
because the parameterizations are a bit tricky. Here is something to get 
you started. Consider the map 


a(4t/(s + 1)) 


ifO < t < (s + l)/4, 

if (s + l)/4 < t < (s + 2)/4, and 

if (s + 2) /4 < t < 1. 


F(s,t)= /3(4r — (5 + 1)) 

T(?) 


Fill in the question mark, and figure out a use for F . 

(e) A space X is path connected if any two points in X can be joined by a 
path in X. Show that, if X is path connected then 7t[ (X, p) = n \ ( X , q) for 
any p, q e X. (In this context, = means isomorphic as groups.) Remark: 
Topological spaces can be weird. If X is path connected then it is always 
connected, but the converse is not true. There are connected spaces that 
are not path connected. Look up “topologist’s sine curve”. 

(f) (For extra credit) Assume that X and Y are two path connected spaces 
of the same homotopy type. Show that their fundamental groups are 
isomorphic. 

4.12 A continuous map to a contractible space is null homotopic Show that 
any continuous map f : X Y is null homotopic if Y is contractible. Use 
this to show that any continuous but nonsurjective map f : X S" is null 
homo topic. 

4.13 k- Connected spaces A space X is k -connected if, for 0 < i < k, any 
continuous map / : S' — > X from the /-sphere to X can be extended 
to a continuous map / : B ,+l — »■ X from the (/ + l)-ball to X, so that 
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f(p) = f(p ) for all p e S'. (It is to be assumed that the sphere is the 
boundary of the ball.) 

(a) Show that X is O-connected if and only if it is path connected. 

(b) Show that X is ^-connected if and only if every continuous map / : S‘ — > 
X is null homo topic, for 1 < i < k. 

(c) Show that if X is ^-connected and Y ~ X then Y is ^-connected. 

(d) Show that S" is not n-connected. Remark: One can also show that S n is 
/-connected for 1 < / < n, but the proof is more difficult. The basic 
idea is simple, though. One shows that any continuous map from a lower 
dimensional sphere to a higher dimensional one is homotopic to a map 
that is not surjective and so, by Exercise 4.12, the original map must be 
null homotopic. See e.g. [56], Theorem 4.3.2. 

4.14 Cohomology of the torus Show that the cohomology of the 2-torus is 
given by 


R, k = 0, 

R © R, k — 1 , 

R, k = 2, 

0, k > 2. 

Can you identify natural generators in each degree? Hint: Try a covering by 
two overlapping cylinders and use Mayer-Vietoris. You will want to invoke 
the result discussed in Example 4.4 to show that %[r* has rank 1 on cohomol- 
ogy. The key point is to make sure that xjr* doesn’t kill more cohomology 
classes than you might expect. 

4.15 Cohomology of the punctured torus Compute the de Rham cohomology 
of a punctured torus, namely a torus with a point removed, and show that it is 
given by 


= 


#dR (T' 2 ) 


R, k = 0, 
R © R, k = 1, 
0, k> 2. 


Hint: Let U = T' 2 be the torus minus a point p, and V an open disk on the 
torus containing p, and use Mayer-Vietoris. Use the fact that x/r* is the zero 
map on cohomology, which you can prove in Exercise 6.14. Actually, there’s 
a really easy way to see that this is the right cohomology: just show that T' 2 
is homotopic to a bouquet of two circles (two circles joined at a single point) 
and then use the result of Exercise 4.18. 
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4.16 Cohomology of genus-# surfaces Calculate the de Rham cohomology of 
a genus-2 surface, then guess the general result for a genus-# surface (which 
can be proved by induction). For the definition of “genus” see Exercise 5.9. 
Hint: Use the result of the previous exercise and Mayer-Vietoris again. 

4.17 The suspension isomorphism Prove the suspension isomorphism 
H k (M) = H k+l ('EM), where EM is the suspension of M and k > 1. Also, 
show that H°(M) = H l (EM) ©R and H°(EM) = R. Use these results and 
the fact that ES n = S n+1 to compute the cohomology of spheres by induction. 
Hint: Let p and q be the two endpoints of the suspension. Let U = EM — {p} 
and V = EM — {q}. Argue that U and V are contractible and that U D V 
is homotopic to M. (You needn’t exhibit the homotopies explicitly.) Do not 
assume M is connected. 

4.18 Cohomology of a wedge sum Let X and Y be two topological spaces, and 
suppose that p e X and q e Y are distinguished basepoints. The wedge sum 
X V Y of two topological spaces X and Y (along p and q) is {X ]J Y}/(p ~ 
q). Show that for two manifolds M and N with distinguished basepoints, 
H°(M V N) © R = H°(M) © H°(N) and H k (M vJV) = H k (M ) © H k (N) 
for k > 0. Hint: The naive thing to do is to set U = M and V = N and use 
Mayer-Vietoris. But this will not work, because neither M nor N is open in 
M V N. Instead, let U p e M be an open ball around p and let V q € N be an 
open ball around q. Define U = M U V q and V = N U U p and go on from 
there. 


5 

Elementary homology theory 


In these days the angel of topology and the devil of abstract algebra fight 
for the soul of each individual mathematical domain. 

Hermann Weyf' 

In order to gain a deeper understanding of smooth manifolds and the de Rham 
theory, we travel in this chapter into the nether reaches of topology. In the process 
we encounter a menagerie of topological invariants and lay the groundwork for 
the following chapter, in which we acquire an understanding of that most basic of 
operations, integration. 


5.1 Simplicial complexes 

A set C G K" is convex if, for every x,y G C, the line segment 

tx + (\-t)y 0 < f < 1, (5.1) 

is contained entirely in C. In other words, you can walk from any point in a convex 
set to any other along a straight line and never leave the set. See Figure 5.1 for 
some examples of convex and nonconvex sets. The convex hull of a set U, denoted 
conv U, is the smallest convex set containing U. (Figure 5.2.) Equivalently, conv U 
is the intersection of all convex sets containing U . A polytope (Figure 5.3) is the 
convex hull of a finite set of points. 1 

EXERCISE 5.1 Let S := {pi, . . . , pk] be a finite set of points in R”. If p — JT t, pi 
with n > o and yr tj = 1 then p is a convex combination of the points in S. Show 
that the convex hull of S is precisely the set of convex combinations of the points in 
S. Hint: Use induction on k. 


' Hermann Weyl, Invariants, Duke Math. J. 5:3, 489-502. Copyright 1939, Duke University Press. All rights 
reserved. Republished by permission of the copyright holder, Duke University Press, www.dukeupress.edu. 

1 For a friendly and comprehensive tour of the world of polytopes, see [92]. 
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convex 




nonconvex 


Figure 5.1 Convex and nonconvex sets. 


(a) (b) 

Figure 5.2 The convex hull of (a) is (b). 




Figure 5.3 Polytopes. 


Let v e W and a el, and let g(-, •) be the usual Euclidean metric. Any set of 
the form {x e M" : g(x, v) = a} is called an affine hyperplane. It determines two 
(closed) half-spaces, namely {x e M" : g(x, v) > a} and {x e M" : g(x, v) < a}. 
The affine hyperplane H is a supporting hyperplane of an /? -dimensional poly tope 
P if H meets P in some nonempty set and P is confined to one of the half-spaces 
determined by H. (See Figure 5.4.) Any set F of the form P D //, where H is a 
supporting hyperplane of P, is called a face of P. The definition allows for F — P\ 
all other faces are proper. The dimension of a face is the dimension of the smallest 
affine subspace containing it. A zero-dimensional face is a vertex of P, a one- 
dinrensional face is an edge of P, and an (n — l)-dimensional face is a facet. The 
collection of all the facets of a polytope is the boundary of the polytope, and the 
interior of P is P minus its boundary. Because polytopes have facets or “seats”, 
they are also called polyhedra (“many seats”). 2 It seems obvious that a polytope 

2 Technically, a polyhedron is the intersection of a finite number of half-spaces, and can therefore be unbounded. 

To get a polytope we must insist that the polyhedron not contain a ray. 
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Figure 5.4 A polytope, two of its supporting hyperplanes, and two half-spaces. 
The intersections of the hyperplanes with the polytope determine two faces, an 
edge and a vertex. 










An independent set 
in R 2 


A dependent set 
in R 2 


Figure 5.5 Affinely independent and dependent point sets in the plane. 


is the convex hull of its vertices and that every face is just the convex hull of some 
subset of the vertices, but the proofs are surprisingly subtle, so we will omit them. 

A set of points S := [po, pi. , pd } in M" is (affinely) independent if the 
vectors p\ — p 0 , p 2 — po, . ■ . , Pd — Po are linearly independent, (see Figure 5.5.) 
A simplex is the convex hull of an independent set of points. We write [S] to 
denote the simplex determined by the set S. If |S| — d + 1, the dimension of [S] 
is d, so we called it a ^/-simplex. A triangle is a 2-simplex and a tetrahedron is a 
3-simplex. (See the two polytopes on the left in Figure 5.3.) The vertices of [S] are 
precisely the points of S. Because a subset of an independent set of points is itself 
independent, any proper subset T c S determines a lower dimensional simplex 
[T], which is necessarily a face of [5]. If |T| = k + 1, [T] is a k- face of [5]. The 0- 
faces are the vertices, the 1-faces are the edges, and the ( n — l)-faces are the facets. 

EXERCISE 5.2 Let S = { po, . . . , pd} be the vertices of a simplex [5]. Show that 
every point in [.S'] can be written uniquely as JT ti p, with ti > 0 and Yl j U — 1 ■ 
(The coefficients ti are called the barycentric coordinates of p.) 


Remark As we have defined them, (geometric) simplices are closed subsets of 
M". Given an independent set of points S = [po, . . . , p n }, one can also define the 
open simplex, (S), to be the set of all points of the form YH=o To where t, > 0 

3 See e.g. [92], Propositions 2.2 and 2.3, or [57], Proposition 5.3.2. 
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simplicial complexes 



Figure 5.6 Simplicial complexes and noncomplexes. 

• » • • < • 


Po 


Pi 


Po 


Pi 


(PthPl) (Pl,Po) 

Figure 5.7 Oriented 1-simplices. 


and JT tj — 1. If T C S, (T ) is an open face of [5]. The open simplex (S) is 
the topological interior of [S], and [S] is the topological closure of (.S). Note that 
vertices are both open and closed faces and that a simplex is the (disjoint) union of 
its open faces. 


At last we come to the main definition. A simplicial complex K is a finite 
collection of simplices satisfying two properties, as follows. 

(1) If [S'] e K and T c S then [T] e K. (Every face of a simplex in K is a 
simplex in A".) 

(2) For any [Si], [S 2 ] e K, [Si] H [S 2 ] = [Si IT S 2 ]. (If two simplices meet, they 
do so along a mutual face.) 

The dimension of K is the maximum dimension of all the simplices it contains. 
Intuitively, a simplicial complex is a set comprising a bunch of simplices that are 
glued together along their faces. (See Figure 5.6.) 

It turns out to be algebraically useful to assign an orientation to a simplicial com- 
plex. An oriented simplex a = ( p 0 , p 1 , . . . , p n ) is a simplex [p 0 , p 1 , . . . , p n ] in 
which we have chosen an order for the points, up to an even permutation. That is, all 
the simplices are divided into precisely two equivalence classes, or orientations, 
depending on whether one chooses an even or odd permutation of the vertices of 
[p 0 , p 1, ■ • . , p n ]. 


Example 5.1 Consider the oriented 1-simplex (po, p 1 ). The 1-simplex (pi, po) 
has the opposite orientation. We can represent these two simplices as in Figure 5.7. 
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P 2 P2 



Po Pi Po Pi 

(P0,Pl,P2) {P2,Pl,Po) 

Figure 5.8 Oriented 2-simplices. 


Example 5.2 The ordered collections {po. Pi, Pi), (Pi , Pi, Po), and (pj- po, p\ ) 
represent the same oriented 2-simplex, whereas {po, Pi, Pi), {pi. Pi, Po), and 
(p 1 , po, P 2 ) represent another oriented 2-simplex. The two oriented simplices have 
opposite orientations. We can represent them as in Figure 5.8. 

Let K be an oriented simplicial complex, namely a simplicial complex in which 
each simplex has been assigned an orientation. An m-chain on K with coeffi- 
cients in R is a formal linear combination of the oriented m -simplices in K with 
coefficients in M: 

c = y_ cij o j (a,- e M), (5.2) 

i 

subject to the convention that if a, and dj represent oppositely oriented simplices 
then a, = —&j. The vector space of all /« -chains on K is denoted C m (K). 

Let a = (po. Pi, . . . , p n ) be an oriented //-simplex. The boundary da of a is 
defined to be the (n — 1) -chain 

n 

da = y ( -iy(p 0 , ...,pj,..., p n ), (5.3) 

j = 0 

where the caret means that the vertex pj is deleted. (See Figure 5.9.) The boundary 
operator 

3 : C m (K) — ► Cm-xiK) (5.4) 

is obtained as the unique linear extension of the boundary map on simplices. Actu- 
ally, there is one boundary operator for each m . When we wish to distinguish them, 
we write d m for the boundary operator whose domain is C m (K). 

Example 5.3 We have 

d 2 (P 0 , Pi, P 2 ) = 3{(pi, Pi) + ( P 2 , po) + {po, Pi)} 

= P2 ~ Pi + PO - P2 + pi ~ PO 
= 0 . 

We now arrive at a very simple but surprisingly powerful theorem. 
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d{po,Pi) =Pi~Po d(po,Pup 2 ) = {pi,P2) - ( Po,P2 ) + (po.Pi) 

Figure 5.9 The boundary operator in action. 


Theorem 5.1 The operator d 2 = 0, i. e. , the boundary of a boundary is zero. 


Proof The general proof is similar to that of Example 5.3, in that it uses pairwise 
cancellation. The only tricky thing is keeping track of the summations: 


9[3(po. • • • - Pm+ 1)] = 9 


m + 1 


...,p Pm+l) 


j = 0 


m+1 

i = o 


1 


^2(-l) k {P0, ...,Pk,...,Pj,..., Pm + l) 


Lk=0 

m+l 


+ ^ (-1) A ' l (P0, ...,Pj,...,Pk,..., Pm+l) 


k—j + 1 


= ...,Pk,...,Pj,..., Pm + l) 

k<j 

+ ...,Pj,...,Pk,..., P,n+\) 


j<k 


= 0 . 


(For the last equality, flip the dummy indices j and k in the second term.) 


□ 


5.2 Homology 

The utility of simplicial complexes in topology is that they provide simple and 
tractable models of various spaces. More precisely, a triangulation of a topolog- 
ical space X is a simplicial complex K that is homeomorphic to X. For instance, 
<7 = (po, . . . , p n ) is a triangulation of the (n + l)-ball B n+l , while its boundary 
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da is a natural triangulation of the n-sphere S". 4 If a triangulation of X exists, X 
is said to be triangulable. It is perhaps not too surprising that there are topologi- 
cal spaces that are not triangulable, because general topological spaces can be quite 
complicated. What is really strange is that there are topological manifolds ( e.g ., the 
so-called E8-space) that are not triangulable either. However, all smooth manifolds 
can be triangulated in a particularly nice way (see Chapter 6), and in that case the 
availability of a triangulation of a manifold makes it significantly easier to ascertain 
its topology. Before returning to the land of smooth manifolds, however, it is worth 
spending a little time in the land of merely continuous manifolds in order to better 
understand the idea of “homology”. We begin by discussing simplicial homology. 

An m -chain c with no boundary (3c = 0) is called an m -cycle. The space of all 
m-cycles on K is labeled Z m (K). 5 6 7 An m -chain c that is the boundary of an (m + 11- 
chain c' (c — d c') is called an m -boundary. Two m-cycles whose difference is an 
m-boundary are said to be homologous. We write 

zi ~ Z 2 when zi - Zi - dc. (5.5) 

The equivalence class of all m -cycles homologous to a fixed m -cycle z is written 
[z]. The set of all such equivalence classes is the mth simplicial homology group, 
denoted H m {K). In other words, the mth simplicial homology group is just the 
quotient space of cycles modulo boundaries: 

H m (K) = Z m (K)/B m (K). (5.6) 

Although it is not obvious at this stage, the homology of K is independent of 
the orientation chosen for its computation; it depends only on the underlying 
complex K. v 

Because we are dealing here with only real-valued quantities, H m (K ) is a vector 
space. All vector spaces of the same dimension are the same (isomorphic), so the 
only relevant information contained in H m (K ) is its dimension fi m = dim H m (K), 
called the mth Betti number of K. As we shall see, /3 m (K) basically counts the 
number of m -dimensional “holes” of K . 

There is an evident similarity between simplicial homology and de Rham coho- 
mology, although the precise connection between them is certainly not clear at 
this stage. In order to forge a link, we first discuss yet another homology theory, 

4 When comparing topologies we just forget about the orientation of the complex. 

5 “Z” for the German “Zykel”. 

6 See e.g. ([75], Corollary 7.25). 

7 Technically, we ought to write H m (K. F) instead of H m (K), to emphasize that our chains are defined with 
coefficients chosen from a field F. When the field F is replaced by a general abelian group G, H m (K , G) 
contains additional information called torsion (not to be confused with another kind of torson to be introduced 
in Section 8.2). For more information, consult [75]. 
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Figure 5.10 A singular 2-simplex. 

called the singular homology of a space X and denoted HJX). In view of our 
eventual destination (namely smooth manifolds), we will proceed along somewhat 
unconventional lines although, of course, we will end up with exactly the same 
theory. 

Intuitively, a singular n -simplex in X is just a continuous map </> : a" — > X 
from an //-simplex to A. The problem is that, starting from different //-simplices 
we could end up with the same image, and in that case we would really like to say 
the two singular simplices are the same. In the conventional approach one avoids 
this problem by restricting a n to some kind of standard simplex, but we will find it 
convenient not to do this. Instead, if 4> : a” -» X is a continuous map, we define 
a singular //-simplex to be an equivalence class (rf ", </>) where two classes (a n , <fi) 
and (f”, t/r) are the same whenever 4>{o n ) = \//(f' ! ). Equivalently, if / : a n — > r n 
is the natural order-preserving affine map, then cf> = x// o f. (See Figure 5.10.) 

A singular //-chain with coefficients in M is a formal linear combination of the 
singular //-simplices in X, and the space of all such chains is denoted S n (X). To 
define a homology theory on X we need an analogue of the boundary operator. Let 
a = {a, (/>). Suppose that <)a = £T ±<f ; , where a, is a facet of a. We define the 


The map 0 is not required to be injective, so singular simplices may not look much like their rigid counterparts. 
In particular, the image of a simplex could be a lower-dimensional object such as a point - hence the word 
“singular”. 
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Figure 5.11 A triangulation of the circle. 

boundary of a to be da := JT ± 07 , where 07 = ( 07 , cf>), and extend it linearly to 
singular //-chains. We automatically obtain 3 2 = 0, whereupon we simply repeat 
everything all over again. The singular //-cycles are Z n (X) := ker 3|s n (x), the sin- 
gular //-boundaries are B n (X) := im3| 5n l (x), and the singular nth-homology 
H„(X ) is Z n (X)/ B n (X). With a little work one can show that H n (X) is ahomotopy 
invariant and therefore a topological invariant. 

As you might have guessed, there is a relation between the two kinds of 
homology we have discussed. If K is any triangulation of X then 9 

H.(K) = H.(X). (5.7) 

This is good, because computations in the land of singular chains are usually quite 
involved, whereas computations in the land of simplicial complexes are relatively 
straightforward (at least in principle). 

Example 5.4 Here is a simple example to illustrate the computation of simplicial 
homology. The circle .S' 1 admits a triangulation K, as illustrated in Figure 5.11. 
(One can think of the map (p as the formal linear combination of three singular 
1-simplices.) 

We want to compute the homology of the circle, which we can do by computing the 
Betti numbers of K . The homologies are all of the form Hk = , where Bk is the 

kth Betti number. As Zk = ker <)k , lh = im <h+i , and Hk = Zk/Bk, Theorem 1.4 
gives 

/3k = dim Hk = dim ker dk — dimim3i+i. 

Thus, to find the homologies of the triangle K , we just need to compute the ranks 
and nullities of the boundary maps dk- 

Since K consists of a single oriented triangle (po, pi, pj), every 0-chain on K is of 
the form 

co = aopo + a\p\ + a 2 Pi, 

for some scalars a,- . The boundary of a point is zero, so doco — 0. That is, do is the 
zero map. In particular, dim ker 3o = dim Co = 3. 

The most general 1 -chain is 

Cl = feo(pi, Pi) + po) + blipo, Pi), 

9 The proof of this is not too difficult, but we shall not give it here. See e.g. [37] or [75]. 
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for some scalars /;,■ . Its boundary is 

9ici = £>o(P2 - Pi) + b\{po - pi) + b 2 (pi - po) 
= (b i - b 2 )po + (b 2 - b 0 ) pi + ( b 0 - b\)p 2 . 


Thus cq = 9ci if 


or 



ao = b\- b 2 , 
a\ = b 2 — bo, 
a 2 — bo ~ b\ , 

0 1 -1\ fb 0 \ fa 0 \ 

1 0 1 *1 = at 

1-1 0 / \b 2 ) W 


(5.8) 


The matrix in (5.8) represents the boundary map 9i . The first two rows of that matrix 
are linearly independent but the sum of all three rows is zero, thus it has rank 2. 
Therefore 


Po = dim ker 9o — dim im 9i = 3 — 2 = 1 . 

Observe that dim ker 9i = 1, which follows from the rank-nullity theorem. 

There are no 2-cycles in K, so no 1-cycle is the boundary of anything. Hence 
dim im d 2 = 0, which implies that 


Pi = dim ker 9i — dim im d 2 = 1 — 0 = 1 . 


All the higher degree homologies vanish because there are no k-chains for k > 1. 
Summarizing, we have 


H k (S l ) = 


Jr, 

1 ° 


k = 0 or k = 1 , and 
otherwise. 


(5.9) 


Example 5.5 Just for kicks, let’s do one more example. This time we want to 
compute the homology of the two-dimensional disk (2-ball) D 2 . The disk is home- 
omorphic to the complex K consisting of a triangle together with its interior. 
Proceeding as before, we find /So = 1 again. But p\ is different, for now there is 
a nontrivial 2-chain, namely K itself: c 2 = (po, pi, p 2 ). It follows that d 2 has rank 
1, because it maps C 2 (K) to a one-dimensional subspace of C \(K) spanned by the 
chain 


(PO, Pi) + (Pi, Pi) + (P2, PO)- 
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Hence fi\ = dimker3i — dimim32 = 1 — 1=0. Lastly, what of HT1 The 2-chain 
a(po , p\ , p 2 ) is a 2-cycle if it has no boundary, but we just saw that it does have a 
boundary. So there are no 2-cycles, and Hi = 0. It follows that 


H k {D 2 ) = 


Jr, 

1 ° 


k — 0, and 
otherwise. 


(5.10) 


This differs from ///.(.S' 1 ), so the disk and its boundary are not homeomorphic, a 
result we previously obtained using de Rham cohomology. 


EXERCISE 5.3 The 2-sphere S 2 can be triangulated by a simplicial complex consist- 
ing of four triangles homeomorphic to the surface of a tetrahedron. Call this complex 
K . By computing the simplicial homology of K . show that 

, f M, k = 0 or 2, and 
H k (S 2 ) = 

1 0 otherwise. 

Generalize to the n -sphere S" . (Don’t do the computation for .S'" - just guess the 
analogous answer. Of course you could always use Exercise 4.7 together with de 
Rham’s theorem from Chapter 6.) 


5.3 The Euler characteristic 

As one might imagine from the previous examples, even the computation of sim- 
plicial homology can become prohibitively difficult if the approximating complex 
has many simplices. 111 Thus it would be nice to have an even simpler invariant at 
hand. The Euler characteristic is such an invariant. It is the simplest, and there- 
fore the most fundamental, topological invariant of a space. It satisfies all kinds of 
intuitive properties, and it is often easy and fun to compute. 

The Euler characteristic x (X) of a topological space X is the integer 

dimX 

x(x)=£(-i yft, (5.ii) 

/= 0 

where /3, = dim Hj(X) is the / th Betti number of X. 

Example 5.6 The Betti numbers of the n- sphere are 

1 if k — 0 or n, and 

MS n ) = n . 

1 0 otherwise. 


10 This is the reason why Whitehead invented the theory of CW complexes. A CW complex is a prescription for 
gluing together cells - essentially topological balls - according to certain rules. Typically a space needs many 
fewer cells than simplices for its description, so the homology theory becomes a bit simpler. See [75]. 
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X (•) -1 
x (—.) =2-1 = 1 
x(A) = 3-3 = 0 


x(A) = 3 - 3+1 = 1 

x(Jk/)= 4-5 + l = 0 


Figure 5.12 Euler characteristics of some simplicial complexes. 


The Euler characteristic of the n -sphere is therefore 

X(S n ) = 1 + ( — 1)” - 

In particular, the Euler characteristic vanishes for odd-dimensional spheres and 
equals 2 for even-dimensional spheres. 


If X admits a triangulation by a simplicial complex K then it suffices to compute 
the Euler characteristic of K . For that, we use the following pretty theorem. 

Theorem 5.2 Let K he an oriented simplicial complex. For each j, with 0 <j< 
dim K, letcij denote the number of j -simplices of K ( the number of j -faces of 
K). Then 

dim K 

X(K)= £(-iya;. (5 ‘ 12) 

i=o 

This is the form of the Euler characteristic that you may recall from high 
school: vertices minus edges plus 2-faces, etc." Some examples are provided in 
Figure 5.12. 

Proof The simplicial complex K induces an exact sequence of chain spaces 
(called, appropriately enough, a chain complex), 

■ ■ ■ ^ Cm+i (K) > C m (K) > > ■ ■ ■ 

where C-i(K) is the zero space, by definition. By the rank-nullity theorem, 

a m = dim C m 

= dim ker 3„, + dim im 3„, 

= dim Z m + dim K m -\ ■ 


it 


In fact, you may recall that this formula applies more generally to poly topes whose faces are not simplices. 
That generalization follows simply by triangulating the faces of the poly tope. 
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But, as we saw in Example 5.4, 

p m - dim Z,„ — dim B m . 


Thus, 

dim K 

X(K)= ^(-l) ffl A« 

m= 0 
dim K 

= Y (~l) m (dim Z m - dim B m ) 

m= 0 

dim K dim K 

= Y dim + Y dim B »> 

m = 0 m= 0 

dim /T dim A" 

= ^ (-1)'" dim Z m + Y (-l) m dim # m-i 

m=0 m = 1 

(because = 0 for m > dim K) 

dim K 

= Y (-!)”’ (dim z m + dim B m _i) 

m=0 

dim 

= Y (- !)”«»• 

m= 0 


□ 


Additional exercises 


5.4 Perron-Frobenius theorem Let A be a matrix all of whose entries are 
positive real numbers. Prove that A has a positive real eigenvector (one 
with all positive entries) with positive real eigenvalue. Hint: Let cr"^ 1 be the 
(n — l)-simplex in M", given by l x ‘ = 1 and x l — 0- Consider the map 
/ : er" -1 — > M" given by 


fix) = 


Ax 


where x = (jc 1 , . . . , x n ) and \x\ = \x'\ is the so-called “L 1 norm”. Show 

that / : cr' !_1 — >• cr" -1 and use the Brouwer theorem. Remark: This theorem 
was originally due to Perron and Lrobenius independently (with very different 
proofs). 
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Figure 5.13 The Platonic solids. 


5.5 Suspensions of spheres Let K and L be simplicial complexes on disjoint 
vertex sets. Their join K * L is the simplicial complex consisting of all sim- 
plices of the form [5 U T], where [S'] e K and [J] e L. The //-fold join K* n 
is the join K * K * ■■■ * K of n copies of K, where we imagine the vertex 
sets of each copy to be disjoint. Let X be the //-fold join (S 0 )*", where S° is 
the 0-sphere consisting of two points. In this problem we wish to show that 
X may be identified with a simplicial complex arising from the boundary of 
a certain convex poly tope. 

The n -dimensional crosspolytope is the convex hull of the points {e lt —e\, 
. . . ,e n , —e„}, where e, is the ith standard basis vector of M". For example, 
the three-dimensional crosspoly tope is just an octahedron. Crosspolytopes 
are simplicial, meaning that all their faces are simplices. By the properties of 
polytopes, the collection of all the faces is then a simplicial complex, called 
the boundary complex of the polytope. 

Show that X is isomorphic to the boundary complex of the //-dimensional 
crosspolytope. Hint: Show that S C {e\,—e\, ...,e n ,—e n ] is the ver- 
tex set of a proper face of the crosspolytope if and only if, for any i, 
S never contains both <?, and —e,. Remark: It is easy to see that the //- 
dimensional crosspolytope is homeomorphic to the //-dimensional ball B n , 
and its boundary complex is homeomorphic to the (// — l)-dimensional 
sphere S n ~ l . Furthermore, the join of a complex K with S° is just 
the suspension of K, so this problem shows why the suspension of S k 
is S k+] . 

5.6 The Platonic solids A polygon is a two-dimensional convex polytope. If 
the angles between neighboring edges are all equal, it is equiangular. If 
the lengths of all edges are equal, it is equilateral. If it is both equiangular 
and equilateral, it is regular. The first few regular polygons are the triangle, 
square, and pentagon. A three-dimensional convex polyhedron is regular, or 
a Platonic solid, if its two-dimensional faces are regular polygons all of the 
same size and if the same number of faces meet at every vertex. In this prob- 
lem we will prove that there are only five Platonic solids (so-named because 
they were well known to the ancient Greeks, especially Plato). They are the 
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tetrahedron, cube (hexahedron), octahedron, dodecahedron, and icosahedron. 
(See Figure 5.13.) 

We will use the fact that the Euler characteristic is a topological invariant. 
The surface of each polyhedron is topologically equivalent (homeomorphic) 
to a two-dimensional sphere S 2 . Thus, all the polyhedral surfaces have the 
same Euler characteristic. Examining each figure, we see that the Euler char- 
acteristic x = V — E + F of each surface is 2, where V is the number 
of vertices, E the number of edges, and F the number of two-dimensional 
faces. 

(a) Tabulate the face numbers V, E, and F for each Platonic solid surface, 
and verify that / = 2 for each one. The face numbers of some of the 
figures are related. Can you explain why? What about the face numbers 
of the tetrahedron? 

(b) Let p be the number of edges of each face and q the number of faces 
meeting at each vertex. For example, for the octahedron p — 3 and 
<7 = 4. Label each solid by its Schlafli symbol { p. q}. Thus, the Platonic 
solids are labeled {3, 3}, {4, 3}, {3, 4}, {5, 3}, {3, 5}. Using the properties 
of the Platonic solids together with Euler’s equation, show that we must 
have 

1 1 1 

p q 2 

and thereby show that the only possible regular polyhedra are the Platonic 
solids listed above. Hint: Relate the face numbers to p and q. 

5.7 The Euler characteristic of the 2-torus Using the triangulation of 
the two-dimensional torus T 2 shown in Figure 5.14, compute its Euler 
characteristic j (T 2 ). Hint: Be careful not to overcount the vertices and 
edges! Remark 1: This seems like a lot of triangles. Actually, the num- 
ber could be reduced, but only to 14, and then the triangulation would 



Figure 5.14 A triangulation of the torus (with opposing edges identified). 
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not look as nice. Why 14? That’s a story for another time. Remark 2: Note that 
X (T 2 ) differs from x (S 2 ), thereby showing that the two surfaces are topolog- 
ically distinct. We already know this from our cohomology calculations, but 
counting simplices is easier, don’t you think? 

5.8 Simplicial homology of a wedge sum Let K be the simplicial complex 
consisting of two copies of the boundary of a 3-simplex glued along a sin- 
gle vertex. In other words, K is the wedge sum of two tetrahedra. (For the 
definition of a wedge sum of topological spaces, see Exercise 4.18.) 

(a) Calculate / (K ) by counting faces. 

(b) Compute the simplicial homology groups of K and verify (5.11). Hint: 
You may use the facts, proved for the de Rham cohomology in Exer- 
cise 4.18, that H 0 (K V L) © R = H 0 (K) © H 0 (L) and H k (K v L) = 
H k {L) © H k (L) for k > 0. 

5.9 The Euler characteristic of a genus g surface For most spaces X (except 
some pathological counterexamples), the Euler characteristic is additive. 
That is, 


(5.13) 


X(X) = x(X\U) + x(U), 


where U is a subspace of X and X\U means X with U removed. 12 This 
formula often provides a simple way of calculating the Euler characteristic 
that avoids triangulations. 

(a) Let U and V be subspaces of a space X. Use (5.13) to show that the 
Euler characteristic is a valuation, namely that it satisfies the property of 

inclusion-exclusion : 


x(u\JV) = x(U) + x(Y)-x(un v). 


(5.14) 


(b) A two-dimensonal torus T has no boundary. (One way to see this is 
that the two-dimensional torus is the boundary of the solid torus and the 
boundary of a boundary is always zero.) But if you remove an open disk 
from somewhere on T you get a surface T with boundary, namely the 
boundary of the disk. If you glue together two copies of T along each 
of their boundaries, you get the connected sum of the two original tori, 
written T#T. 

If you now cut out an open disk from the two-holed torus and glue 
another copy of T' along their respective boundaries you get a three- 
holed torus, or T#T#T. Continuing in this way you get a g-holed 

12 It is worth noting that U can be open or closed. 
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torus, where g is called the genus of the surface. (The two-dimensional 
sphere can be thought of as a 0- holed torus, or a genus-0 surface.) 


V 


V 




T#T 


Show that the Euler characteristic of a genus-g surface is x = 2 — 2 g. 
Relate this to the result of Exercise 4.16. 

Remark: A compact manifold without boundary is often said to be closed 
(not to be confused with the topological meaning of “closed” introduced 
in Section 3.1). The classification theorem for closed two-dimensional 
surfaces asserts that any closed connected orientable-two dimensional 
surface is homeomorphic to a £ -holed torus (including the case g = 0); 
if the surface is nonorientable then it is homeomorphic to the connected 
sum of real projective planes. For an elegant proof of this due to Conway, 
see [26]. 

5.10 Chain maps and homology groups (This exercise is the homological ver- 
sion of Exercise 4.4.) Let K be a simplicial complex, and let C, := C,(K , IF) 
be the space of /-chains of K with coefficients in IF. Let 3 ,■ : C, — »■ C,-_i be 
the boundary operator. As 3 f = 0 for all / , we get a sequence 


di+2 


> C, 


di+l 


i+1 


► c, 


C ; _! 


3j-1 


For this reason, any sequence of vector spaces C, with boundary maps 3,- is 
called a complex regardless of whether the C, arise from some simplicial 
complex. We adopt all the previous terminology, so that Z, (C) := ker3, is 
the group of / cycles, B, (C) := im3 I+ i is the group of / boundaries, and 
Hj(C) := Zj(C)/ Bj(C) is the ith homology group of the complex C. 

Suppose now that we have two complexes, (A,, 3,) and (B t , 3 ; ). A chain 
map r/y : (A, , 3 ; ) — y ( B , , 3,) is a sequence of linear maps that commute with 
the boundary maps. More precisely, we have 
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x/fi-idi = dix/n (5.15) 

for all i . A pair of complexes related in this way by a chain map is called a 
homomorphism of complexes and is elegantly represented by the following 
commutative diagram: 


I 1 

V’i+l . D 


A x 

+1 

l/’i+l 



+ 1 



di+ 1 



A, 

i 

-> Bi 

(5.16) 

* 



\ 



A % 

-1 

lpi-1 

-> B z 

-1 



I I 


The statement that the diagram is commutative is precisely the statement that 
the maps i// commute with the boundary maps 9. 

Show that a homomorphism of complexes gives rise to a homomorphism 
of homology groups. That is, show that there is a natural linear map /z, : 
H, {A) — > Hi(B ) for all i . Hint: Let [a, ] be an element of H t (A), and consider 
[Vo (a,)]. Show the map is well defined, i.e., that it maps homology classes to 
homology classes and is independent of class representative. 

5.11 Mayer- Vietoris for homology and the Euler characteristic As with 
cohomology, a short exact sequence of complexes gives rise to long exact 
sequence of homology groups. In particular, there is a homological analogue 
of the Mayer- Vietoris sequence for manifolds, namely 13 



13 


Precisely what the maps are does not concern us here. For more details, consult [37]. 
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Use this to obtain another proof of (5.14) (for U and V open). Hint: You may 
want to refer to Exercise 1.30. 

5.12 The Hopf trace formula In Exercise 5.10 you showed that a chain map 
\jj : (A, , 3,) — ¥ (Bj, 3,) between two complexes induces a natural linear map 
hi : Hi (A) — > Hj ( B) between the homology groups of those complexes. Now 
suppose that A = B = C. Then i/t : C, — »■ C, and /i, : H, (C) — > H/(C ) 
become linear endomorphisms. As such, they each have a well-defined trace. 
Assuming that C, = 0 for i > n, show that 

n n 

£(-!)'' tr = ^(-l) f trAi. (5.17) 

i=0 i=0 

Hint: Show that there are short exact sequences 

0 * Z, — l —+ Ci — ^ 5,-1 * 0 

and 


0 ► Bj — ► Z, — Hi ► 0, 

where Z, = Z, (C) = ker3,-, B t = B,(C) = im3,-_i, i is the appropriate 
inclusion map, 3 ,■ is the boundary operator, and it, is the natural projection 
onto the quotient. Recall that every exact sequence of vector spaces 

0 * U — V — ^ W * 0 

splits, so there exists a section s : W -> V with n o s = 1. In particular, we 
have V = U © s(W). Thus, if i/r is a linear operator on V then tr (i/r, V) = 
tr(i/r, U) + tr (xfr, .vf IT)), where tr(i /r, S) means the trace of xjj restricted to 
the subspace S. Now relate trA/q - s(B,_\)) to tr(i/r,_| , R,_i) and tr (r/r, , //,) to 
tr (hi, Hi). 

Remark: A smooth map / : M — > M from a manifold to itself naturally 
induces a map /* : C C (see Section 6.3) of chain complexes of the 
manifold. In that case, the right-hand side of (5.17) is called the Lefschetz 
number of the map /. Note that if / is the identity map and M is trian- 
gulable then the Lefschetz number is just the Euler characteristic and the 
Hopf trace formula reduces to Theorem 5.2. The remarkable Lefschetz fixed 
point theorem asserts that if M is compact and orientable and the Lefschetz 
number does not vanish then / has a fixed point, namely a point p e M 
with f(p) = p. In fact, the Lefschetz fixed point theorem is a special case 
of a more general theorem known as the Lefschetz-Hopf theorem. See the 
discussion following Theorem 9.4. 
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. . . Stokes’ Theorem has had a curious history and has 
undergone a striking metamorphosis. 

Michael Spivak' 

After that quick jaunt into general topology we now return to the setting of smooth 
manifolds. Our aim in this chapter is to relate singular homology to de Rham coho- 
mology. It will turn out that they are isomorphic, but to establish this we must first 
introduce a few more concepts. The key tool is integration, and we will derive some 
of its fundamental properties, including both the change of variables theorem and 
Stokes’ theorem. 


6.1 Smooth singular homology 

Simplices (and singular simplices) tend to have sharp corners where derivatives 
do not exist, whereas when dealing with a smooth manifold M we would like 
everything to behave nicely with respect to smooth maps. Ideally we want the map 
</> : d n — > M to be smooth. But smooth maps are defined only on open sets, 
and our simplices are closed. The way around this is to extend </> to a smooth 
map. More precisely, a smooth singular m-simplex a is an equivalence class of 
triples (ft, (7, </>), where d is an m-simplex, U is an open neighborhood of Affix 
(the smallest affine subspace containing d), and (j) : U — »■ M is a smooth map. 
Two triples (cx, U , <p) and (f , V, xfr) are in the same class if 4>{d) = t/r (r) or, 
equivalently, if 0 = \jj o f where / : d — > x is the natural order-preserving 
affine map. 

As before, we define C“(M), the space of smooth singular ^-chains in M, 
to be the vector space generated by all the smooth singular A-siin plices in M. If 
a = (d , U ,4>) and dd = ± 07 , the boundary of a is defined to be da := 

t Excerpt from [79], p. viii. Reprinted by permission. 
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±ct, , where ct, = (a , , V, , </>) and Vj is a small open neighborhood of d, . The 
boundary operator again satisfies 3 2 = 0, so the space of smooth singular 

^-boundaries of M is a subspace of the space Z£°(M) of smooth singular ^-cycles 
in M and we can define the smooth singular homology of M to be the 

quotient space 


= Zf (M)/5f°(M). (6.1) 

It is a happy fact that, for smooth manifolds, the smooth singular homology and 
the singular homology agree. 1 

Theorem 6.1 For a smooth manifold M, 

H™{M) = H k (M). (6.2) 

(For this reason, one often omits the superscript oo in the smooth setting.) One 
way to prove this result is to exploit the fact that every smooth manifold admits 
a smooth triangulation. 2 More precisely, a smoothly triangulated manifold is a 
triple (M, K,f), where M is a smooth manifold, K is a simplicial complex, and 
4> : K — > M is a homeomorphism satisfying the following property. For each 
simplex a e K, the map f : a — > M can be extended to a neighborhood U of a in 
Aff cr in such a way that f : U — > M is an embedding. As a smooth triangulation 
is also a triangulation, we obtain a smooth analogue of the continuous result that 
the homology of M coincides with the homology of the triangulation K . 

Theorem 6.2 If K is a smooth triangulation of M then 

H™(M) = H k (K). 

The next step is to relate Hf°(M) to the de Rham cohomology. Before we can do 
this, however, we must first talk about integration on manifolds. 


6.2 Integration on chains 

First we define the integration of forms on a domain D C M" in the obvious way. If 
co = A(x i, . . . , x n ) dx 1 A • • • A dx n (6.3) 


1 See e.g. ([51], Theorem 16.6). 

- This result is due to Cairnes [13] and to Whitehead [90]. 
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then we define 


where the integral 
wedge products!). 3 


I co := I A(x\, . . . , x„) dx 1 ■ ■ ■ dx", (6.4) 

J D J D 

on the right is the ordinary Riemann integral (which has no 


Example 6.1 If&> = 3dyAdxAdz and if s" = (0, e i , e 2 , . . . , e„) is the standard 
simplex ({e, } being the standard basis of W l ) then 

n 1 n\—X nl—X—y 

dxdydz=—3 dx dy I dz = —l/2. 

Jo Jo " Jo 



Let M be a smooth manifold, 00 a k-form on M, and c a smooth singular &-chain 
on M, where k < n. Then the expression 

l w 

is defined by pulling it back to Euclidean space, where we know the answer. We 
want it to be linear on chains, so if 

c — cij <7 1 (6.5) 

i 


then we set 



(6.6) 


Integration over simplices is carried out via pullback. If a = ( a k , U, </>) then we set 



(6.7) 


In order for this to be well defined, it must be independent of the equivalence class 
representative used to define it. This requirement leads us to the general change of 
variables theorem. 


6.3 Change of variables 

Let / : M ,/V be a smooth map of smooth manifolds. Then there is a natural 
induced linear map 


/* : C k (M) -* C k (N) 


(6.8) 


3 Technically, we are defining our integrals to be oriented integrals, because they depend on the order of the 
variables in the differential forms. This is really the natural definition, as will become evident in the next 
section. The reader should be warned that this convention is by no means universal, although it ought to be. 
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Figure 6.1 The induced map /* on simplices. 


taking k-chains on M to k-chains on N. 4 The map /* is defined to be the linear 
extension of the map sending the simplex a on M , represented by (a, U . 4>), to the 
simplex /*cr on N, represented by (ct, U, f o cf>). (See Figure 6.1.) In passing we 
observe that, for any chain c, 


Mdc) = 9(/*c), (6.9) 

while for smooth maps / : M — > N and g : N — »■ P we have 

(go f)* = g*° /*■ (6-10) 

Return now to (6.7), and suppose that a is also represented by the triple 
(f k , V, f ) ; let / : a k — > f k be the unique order-preserving linear map of sim- 
plices, with cj) = \jr o f . Then the assumed well-definedness of (6.7) is equivalent 
to the chain of equalities 

I \fr*a>= I x/r*(o= f a> = f 4>*co= f (6.11) 

J J Jo J O^ J O^ 


4 There is no danger of confusing this map with the pushforward map defined in Section 3.15, even though they 
both use the same star subscript notation, because the former is a map of chains which live on the manifold 
itself while the latter is a map of tangent spaces, which do not. (In any case, the notation is now more or less 
standard.) 
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(where we have used the fact that / = /* in M"). Thus, to show that (6.7) is well 
defined we must show that the first and last terms in (6.11) are equal. But this 
follows from the usual change of varables theorem in Euclidean space. 


Actually, something a bit more general holds. Let / : M n M" be a 
diffeomorphism. Choose local coordinates, so that / is given by 



( 6 . 12 ) 


and let r? = a(y) dy 1 A ■ ■ ■ A dy" . Let a" be an /i-simplex. Then, by the usual 
change of variables theorem from calculus (and with the assistance of (3.82) and 
the definition (6.4)), we have 



(6.13) 


The same argument proves (6. 1 1) on setting t] := x//*co and restricting / to Aff a k 
Equation (6.13) reveals a natural duality between /* and /* with respect to 
integration. This duality extends naturally to chains on arbitrary manifolds and 
thereby yields the following elegant formulation of the general change of variables 
theorem. 

Theorem 6.3 (Change of variables) Let f : M — > N be a diffeomorphism, to a 
k-form on N, and c a k-chain c on M. Then 



(6.14) 


Example 6.2 The change of variables theorem can also be used to recover the 


classical expressions for line and surface integrals. Let to = A \ dx 1 + A 2 dx 2 + 
At, dx 3 be a 1-form on M 3 , and let y be a curve. In our new terminology, a curve is 


just a (singular) 1 -chain in R 3 . More precisely, if I is the 1 -simplex in R, the 1 -chain 
y is the image of I under y given by 


Y(t) = (y l (t), y 2 (t), y 3 (t)), 


5 


The astute reader will notice that we have cheated here. The usual change of variables theorem employs the 
absolute value of the Jacobian, whereas we have used the Jacobian itself. But that’s all right, because we 
defined our integrals in (6.4) precisely to account for this pesky sign. 


6.4 Stokes’ theorem 
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where t is the coordinate on R. We have 

y*co = y*(A i dx 1 + A 2 dx 2 + A 3 dx 3 ) 

= (y*A I )(y*dx 1 ) + (y*A 2 )(y*dx 2 ) + (y*A 3 )(y*dx 3 ) 

— (A \ o y)dy l H h (A 3 o y)dy 3 

= A(y(t)) ■ dy 
= (A(y(t)) ■ V y)dt, 

where A is the vector field whose components are (A 1 , A 2 , A 3 ), and V is the usual 
gradient operator. (The dot product of two vectors is their Euclidean inner product.) 
Applying Theorem 6.3 gives 



namely the usual line integral of A along the curve y . 

EXERCISE 6.1 Let y(t) = (t, t 2 , t 3 ), 0 < t < 2 be a curve in R 3 . (It is called the 
moment curve.) Let co — yz dx + zx dy + xy dz be a 1-form on R 3 . Evaluate f co. 

EXERCISE 6.2 Let ft = B x dy Adz + B v dz A dx + B z dx A dy be a 2-form in R 3 . 
Let £ be a surface in R 3 , namely the image under a map (p of a 2-chain c in R 2 . By 
choosing coordinates u , v on R 2 such that £ is given by 


(m, v) \-a ( <p l (u , v), (p 2 (u, v), (p 3 (u, v)), 


(6.15) 


show that 



the ordinary flux integral of B over £. Here B — (B x , B y , B z ), dS is the surface 
area element of £, and n = d<p/du x d(p/dv is the surface normal to £ (an ordinary 
cross product). 
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The fundamental theorem of calculus states that 





(6.17) 


We can view the interval [a, b ] as an oriented chain with oriented boundary b — a, 
and write (6.17) more suggestively as 
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where the integral of a function over a point just means the evaluation of the func- 
tion at the point (with the appropriate sign). In this way the fundamental theorem 
of calculus can be interpreted as saying that the integral of the differential of a 
function / over a chain equals the integral of / over the boundary of the chain 
(with the appropriate orientations). 

In multivariable calculus the fundamental theorem is extended from M to M 3 
in the theorems of Stokes and Gauss. So, if A is a vector field, E a surface with 
boundary 9E, and V a volume with boundary 9 V, then we have 


and 


f (V x A) -dS = & A-dl 

Jz /ax 


f (V -A) dV = i A-dS. 

Jv Jav 


( 6 . 19 ) 


(6.20) 


In both these theorems one must choose compatible orientations on the domains of 
integration and their respective boundaries (via the “outward pointing normal”). 

All these formulae are just special cases of a far-reaching theorem of integration 
on manifolds, called simply Stokes’ theorem. Actually, there are two approaches 
to Stokes’ theorem, which are essentially equivalent. For completeness, we discuss 
both of them here. 


6.4.1 Stokes’ theorem 1 

Theorem 6.4 (Stokes’ theorem - chain version) For any k-form a> and (k + I j- 
chain c, 



(6.21) 


Remark In the next section we show how Stokes’ theorem also works for 
“infinite chains” (i.e., noncompact manifolds with boundary), provided we restrict 
ourselves to differential forms with compact support. (The support of a form a>, 
denoted supp a>, is the smallest closed set that contains the set of points at which a> 
does not vanish.) 


Proof By linearity it suffices to prove that 
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for some ( k + l)-simplex a . If a = (.A +l , U, (/>) then this is equivalent to 

f d (.</>* (o) = f = f d(o= f co — f cj)*co. 

Js k+l Js k+1 Jo J do Jds k+1 

It is therefore sufficient to prove that 

/ dk= I A (6.22) 

Js k + 1 J ds 1 




for A a &-form on a neighborhood U of the standard (k + l)-simplex s k+l in M i+1 . 
Again by linearity it is enough to prove (6.22) for a monomial of the form 


A = A(x l , . . . , x k+l ) dx 1 A • • • A dx k . 


We have 


dk = (~1) A 


3 A 
dx k+l 


dx 1 A ■■■ A dx k+l , 


f 

Js k + 1 


so that 

dk = ( 

= ( 
= ( 


— 1)* 

f 9A j 

Jsk+l dx k+i 

■ ■ A dx k 

-D* 

f dx 1 ■ 

■ ■ dx k ( 


A‘>o,y;f = i x‘<\ 

V 

-D* 

f dx 1 • 

• • dx k 





L 


'-ZU** dA 


dx k+1 


dx 


k + 1 


X 


A(x l , . . . ,x k , 1 - Yli=\ x ' ) ~ ...,x k ,0) 


(6.23) 


Furthermore, 

3^' +1 = d{e 0 , ei,..., e k+ i) 

= {e u e 2 , ...,e k +\) + (~l) k+l (e 0 , e u . . . , e k ) ± other faces, 


giving 
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(The form A vanishes on the “other faces”, because on those faces at least one of 
x 1 , . . . , x k is constant.) But (e 0 > e\, . . . , ef) is just the standard simplex s k with 
x k+l = 0, so the second term in (6.24) becomes 

(-1 ) k+l [ A(x l , . . . , x k , 0) dx l ■ ■ ■ dx k . (6.25) 

Js k 

By projecting down along the x k+l axis, the first term in (6.24) becomes 

f A = f A(x x , . . . , x k ,\ — dx x ■ ■ ■ dx k 

J (ei,...,e k+ i) J{e i e*,0> 

= (-l) k [ A(x l , . . . ,x k , \ — Y?i=\ x ‘) dx l ■ ■ ■ dx k . (6.26) 

J { e 0 ,...,e k ) 

Now compare (6.23) with (6.24), (6.25), and (6.26) to obtain (6.22). □ 


Example 6.3 Let co — A\ dx 1 + A 2 dx 2 + A 2 dx k be a 1-form and let E be a 
closed surface in IRA . Then, using the results of Example 6.2 and Exercise 6.2, Stokes’ 
theorem 



becomes 


/ (V x A) ■ dS = / A ■ d</>. 

Jy. JdY 


EXERCISE 6.3 Let a> — —yz dx + xz dy be a 1-form in R 3 , and let U be an open 
subset of the (n, v) plane given by 1 < u < 2 and 0 < v < 2 n. Let E := cp(U) be a 
parameterized surface (2-chain) in M 3 , where 

cp(u, v ) = (« 3 cos v, u 3 sin v, it 2 ). 


Using the change of variables theorem, verify that Stokes’ theorem holds, i.e., 



EXERCISE 6.4 Show that the integral of a closed form over the boundary of any 
chain is zero. Show that the integral of an exact form over a boundaryless chain is 
zero. 


EXERCISE 6.5 Using Stokes’ theorem, show that the property dr — 0 (mixed par- 
tials commute) is equivalent to the property 3 2 = 0 (the boundary of a boundary 
vanishes). 
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6.4.2 Stokes’ theorem II 


The second version of Stokes’ theorem is perhaps more familiar to modern readers. 
It is a statement about the integral of a differential form over an oriented mani- 
fold with boundary, so it looks a little different from the chain version. However, 
the distinction is illusory, as one can obtain the second version from the first (see 
e.g. [88], Section 4.8). But the second version can also be derived ab initio without 
recourse to the machinery of chains, which is what we do here. First we need some 
preparation. 

A (smooth) n -dimensional manifold M with boundary is defined much like 
an ordinary smooth manifold, except that the coordinate patches { U-, } belonging to 
an atlas {Uj, (p , } now come in two varieties: either U, is homeomorphic to M", as 
before, or else IJ, is homeomorphic to the upper half space IF = {(x\ . . . , x") : 
x" > 0}. 6 7 The boundary 3 IF of IF is the set {(x 1 , ... , x n ) : x n = 0}, which is 
homeomorphic to M" _l . The boundary 3 M of M consists of all the points p e M 
such that (pi(p) € 3 IF. It is an embedded (n — l)-dimensional submanifold of 
M , and it inherits an orientation from that of M as follows. Let HT be given the 
standard orientation dx 1 A • • • A dx" : Then by definition the induced orientation 
of 3H" is the orientation class of A = (—1)" dx 1 A • • • A dx n ~ l . If r] is a nowhere 
vanishing (n — l)-form on 3 M then 3 M has the induced orientation if cp*X and rj 
(restricted to Uj D 3 M) are in the same orientation class. 

Let co be a top-dimensional form on an oriented manifold M and let {p,} be a 
partition of unity subordinate to the open cover {£/, }. Then we define 



(6.27) 


That is, we break up the integral into little integrals over each coordinate neighbor- 
hood Uj , integrate the fraction of co belonging to each Uj according to the partition 
of unity, then sum all the contributions. 

EXERCISE 6.6 Show that the definition (6.27) makes sense, i.e., that it is indepen- 
dent of the coordinate covering and the partition of unity. 

Theorem 6.5 (Stokes’ theorem - manifold version) Let M be an oriented smooth 
n-dimensional manifold with boundary, and let co be an (n — \)-form with compact 
support. Then 

6 Note that H M is an open set in the subspace topology even though it is closed as a subset of R" . Therefore there 
is no problem with demanding that the transition maps be smooth. 

7 See Appendix C. 
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(6.28) 


where d M is given the induced orientation. 

Proof Let {U,} be a locally finite covering of M by coordinate neighborhoods 
with subordinate partition of unity {/?, }. Clearly co = pjco. Thus, by the linearity 
of integration, it suffices to prove Stokes’ theorem for each pjco. (Note that p, o> has 
compact support because suppp,<y c suppp, (T suppco is a closed subset of a 
compact set.) As the support of pjco is contained in (/, , and as each If is homeo- 
moiphic to a subset of either M" or H", it suffices to prove the result assuming 
either M = M' ! or M = H" . 


We have 


n 



i = 1 


so that 



If M = R" then 




dx l ---dx l l dx l+1 ■ ■ ■ dx n . 


But co has compact support so a (x) has compact support, and 
daj 

— r dx — cii (. . . , oo, . . . ) — aj(. . . , — oo, . . . ) = 0. 

r) Y l 



dx 


As M" has no boundary, Stokes’ theorem holds for this case. 

If M = H" then everything goes as before, except that the limits of integration 
on the x n integral are now 0 to oo, so we get 



But co restricted to 3HT is 


c ) — at(x l , ...,x n *, 0) dx 1 A • • • A dx' 7 1 , 
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so using the induced orientation on 9H" gives 



3H n JW n 


Hence Stokes' theorem holds in all cases. 


□ 


EXERCISE 6.7 Let / be a function, rj an (n — l)-form, and M an n-dimensional 
manifold. Derive the general integration by parts formula, 



Remark Let i : dM M be the inclusion map of the boundary of M into M. 
Technically, Stokes’ theorem should be written 



(6.29) 


because co really lives on M, whereas to evaluate the integral on the right-hand 
side we must evaluate co on the boundary. Most of the time this formulation is 
needlessly pedantic, but occasionally it is useful. (A similar remark applies to the 
first version of Stokes’ theorem.) 


6.5 de Rham’s theorem 


Having developed the theory of integration on manifolds, we are now in a position 
to fulfill our promise of relating smooth singular homology and de Rham cohomol- 
ogy. As you might have guessed, the key tool that ties them together is integration. 
The basic idea is that integration provides a nondegenerate pairing between de 
Rham cohomology and smooth singular homology, so that the two are dual to one 
another in a precise sense. There are two ways to state de Rham’s theorem, and 
each is worth knowing. We first discuss de Rham’s original formulation, as it fits 
nicely with our previous discussion of integration on chains, and then turn to the 
slightly more modern statement. 

Given a form co and a chain c we get a real number, namely f co. In this sense, 
the space of forms can be thought of as a kind of dual to the space of chains. But 
this duality is not too useful because each space is infinite dimensional and, as we 
have said, infinite-dimensional dualities are treacherous. But we know that both 
the de Rham cohomology and the smooth singular homology of a space are finite 
dimensional, so we might expect that restricting ourselves to elements of those 
spaces might be more profitable, and this is indeed the case. 
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If co is a closed form and z is a cycle then J co is called the period of co on z. 
Two important facts about periods follow immediately from Stokes’ theorem: (i) 
the period of an exact form on any cycle is zero, and (ii) the period of a closed form 
on any boundary is zero. This begs the question of whether the converses are true. 
Put another way, if all the periods of a closed form vanish, is the form exact? Given 
a map from cycles to real numbers that vanishes on all boundaries, is it defined by 
the periods of some closed form? The answer to both questions is yes, and this is 
the content of de Rham’s two theorems?' 

Theorem 6.6 (de Rham’s first theorem) A closed form is exact if and only if all 
its periods vanish. 

Theorem 6.7 (de Rham’s second theorem) Let per be a linear map that assigns 
a real number to every cycle. If per vanishes on all boundaries then there exists a 
closed form co whose periods are given by per; 

co = per(z). 

But what does this have to do with homology and cohomology? Well, every- 
thing, but to see this we first introduce another kind of cohomology. Given a chain 
complex (C., 3), 

• •• > Cf+i c t Ci - 1 

we obtain a dual chain complex (C m , 3*), 

... «. c l+1 c l c l ~ x 

where C l = C* and 3* is the dual map, defined by 

(3 ;/)(c) = fOi+ic), (6.30) 

where / e C l and c e Q+i. (The only thing one needs to check is that 3* is 
nilpotent, but that follows immediately from the nilpotence of 3.) By definition, 
the smooth singular cohomology of M is the homology of (C*(M), 3*) where 
C.(M) is the complex of smooth singular chains. Explicitly, we have 

Z l {M) = {f € C l {M) : d*f = 0} 

= the space of (smooth singular) £■ -cocycles of M, (6.31) 


° Compare Exercise 6.4. 

^ For proofs, see [21] or [42]. 
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B e (M ) = {d*f : f G C e ~ l (M)} 

= the space of (smooth singular) 6-coboundaries of M, (6.32) 
H\M) = Z l {M)/B l (M) 

= the fth-smooth singular cohomology group of M. (6.33) 


(Note that, in general, Z e % Z* and B ' X B*. As for H l and H*, see Theorem 6.9 
below.) 

The theorems of de Rham say that the de Rham cohomology and the smooth 
singular cohomology of a manifold are one and the same. To see this, observe that, 
given any 6-form co, the integral of co can be thought of as a linear map 


given by 




CfXM) -* R, 




(6.34) 


In other words, f co is an f-cochain. 

If co is closed then f co is actually an £-cocycle because, by (6.30), (6.34), and 
Stokes’ theorem, for any (t + l)-chain c, 


3* J co^ (c) = co^ (3c) = J co = J dco = 0. 

Hence, integration provides a homomorphism 


f- 


Z l dR (M) Z\M), 


CO I CO 


/ 


(6.35) 


(6.36) 


from the space of closed £-forms to the space of smooth singular 6-cocycles on M. 


EXERCISE 6.8 Show that if co = dr is an exact f-form then f co is an 6-coboundary. 
That is, the integration map provides a natural homomorphism 

J : B e dR (M) (6.37) 

It follows 10 that the integration map provides a homomorphism 

J : H dR (M) — H e (M), [co] ^ 

of cohomologies. The two theorems of de Rham tell us that this map is actually an 
isomorphism. 


/ 


CO 


(6.38) 


10 


Compare Exercise 1.15. 
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EXERCISE 6.9 Prove that the map (6.38) is an isomorphism. Hint: As a map 
between cohomologies, de Rham’s first theorem says that j is injective and de 
Rham’s second theorem says that f is surjective. 

We therefore obtain the following repackaging of de Rham’s two theorems. 


Theorem 6.8 (de Rham) Let M be a smooth manifold. Then the de Rham 
cohomology coincides with the smooth singular cohomology of M: 

= H\M ). 


The obvious question is: what is the relation between smooth singular cohomol- 
ogy and smooth singular homology? To our great relief, they are isomorphic. More 
precisely, we have the following theorem, which relates the cohomology and the 
homology (with coefficients taken from a field) of any complex. 1 1 


Theorem 6.9 For any complex C and for any coefficient field IF, 

H l (C,V) = Hf(C, F). 

Proof First, we show there is a natural linear surjection h : H f 
[/] e H l and [z] e Hi. Define h by 

Himizl) := f(z). 

This is well defined, for if [/] = [g] and [z] = [y] then g = f + d*b for some b 
and y = z + dx for some x, and 

MtgDflj]) = g()0 = (/ + 3 *b)(z + dx) = f(z), 

because /( dx) = d *f(x) = 0, 3 *b(z) = b(dz) = 0, and d*b(dx) = b( d 2 x) = 0. 
Let g € Hf. This extends naturally to an element fi G Z* by setting 

_ 0 if z e Bi, and 

g(z) := 

g([z]) otherwise. 

Let n : Ci — > Zi denote the natural projection map. Composing g with the projec- 
tion Tt yields an element git G C*. By the definition of g, d*(gjr) = (fin ) 3 = 0 so 
gn G Z l . Therefore \ fin ] G H' . 


(6.39) 
Hf. Let 


1 1 If there were any justice in the world, cohomology would always be isomorphic to the dual of homology. 
Unfortunately, the world is not always just. In general, the issue depends on the coefficient groups one uses to 
define these spaces. For homology and cohomology over general abelian groups (with a corresponding more 
general definition of “dual”), the situation is much more subtle and leads to something called the universal 
coefficient theorem. (See e.g. [37].) 
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We must verify that h takes [g7r] to g. If z is a boundary then 

= h([gn])m) = 0 = g([0]) = g([z]), 

while if z is a nontrivial cycle, we have 

h([g7t])([z]) = gTt{z) = g(z) = gflz]) 
yet again, so all is well. 

We conclude the proof by showing that H e and H, have the same dimensions. 
Consider the chain complex 

3 «+ 2 _ Sr+1 _ de 3 e-i 

■ ■ ■ ► C l+ 1 > Ci » C/_i > 

and the corresponding cochain complex 

... c t+l C l C l ~ l 

By Exercise 1.53, 

Z t = ker d* = Annim dg + 1 = Ann Bi, 

B ‘ - im 3*_j = Ann ker 3 1 = Ann Z, . 

However, for any subspace W of a vector space V, dim V = dim W + dim Ann W 
(Exercise 1.21), so 

dim Zi + dim B l = dim Cg — dim Bg + dim Z l , 
from which we obtain 

dim Zi — dim Bg = dim Z l — dim B l 

or dim Hi = dim H‘ . □ 

Combining Theorems 6.8 and 6.9 enables us to conclude the following. 

Corollary 6.10 

/4(M) = Hi(M). 

From Theorem 6.2 and Corollary 6. 10 we get the following. 

Corollary 6.11 (de Rharn triangulated version) Let K be a smooth triangulation 
of M. Then 


H^(M) = Hg(K). 
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We already knew that de Rham cohomology was a topological invariant, but 
this theorem shows that we get the same topological information from de Rham 
cohomology as we get from simplicial homology, which is reassuring. 12 

For a fun application of homology and cohomology to electrical circuit theory, 
see Appendix G. 


Additional exercises 

6.10 Integration over the sphere Evaluate f g2 co, where o> is the (global) 2-form 
defined in Example 3. 10. Hint: This exercise is actually a bit tricky, because 
of questions about integration domains and orientations. One way to do the 
integral would be to integrate u>u over the patch U that covers everything 
except the north pole, because a single point is a set of measure zero and 
therefore irrelevant to the integral. Alternatively, you could integrate % over 
U 0 , the part of U corresponding to the southern hemisphere, and a> v over 
Vo, the part of V corresponding to the northern hemisphere, and add the 
results together (ignoring the fact that you are integrating twice over the equa- 
tor, because it is also a set of measure zero). Either way, you must choose 
an orientation of S 2 . If you decide to integrate over both Uq and Vo, you 
must make sure that you choose the same orientation on each patch. Answer: 
± 7 r, depending on the orientation chosen. Compare this result with that of 
Exercise 3.40. 

6.11 Integration over the torus Let dx A dy be the global 2-form on the torus 
defined in terms of the global 1-forms dx and dy. (See Exercise 3.23.) Eval- 
uate f T 2 dx A dy. Remark: This is not the area of the familiar torus in M 3 
because, as we defined it, our torus does not actually live in M 3 . 

6.12 Integration using stokes’ theorem Let V = {(x, y, z) e M 3 : x 2 + y 2 = 
(az/ h) 2 . —1 < x, y < 1, 0 < z </i}bea three-dimensional right circular 
(upside-down) cone of height h and base radius a. Let a = x dy A dz + 
y dz A dx. 

(a) Evaluate f dV a . Hint: Parameterize, and watch out for orientations. 

(b) Use Stokes’ theorem to find the volume of the cone. 

6.13 Stokes’ theorem illustrated Let co = y dx + z dy + x dz and let E be the 
disk obtained as the intersection of the plane z = Via /2 with the sphere 
x 2 + y 2 + z 1 = a 2 . Verify that Stokes’ theorem holds in this case. That is, 
show that f gT co = f^dco. 


12 


For a direct proof of Corollary 6.11 see e.g. [64], Section 3.4, or [77], Section 6.2. For other proofs of de 
Rham’s theorem see e.g. [51], Chapter 16, or [88], Chapter 5. 


Additional exercises 


175 


6.14 A cohomological exercise Let M be a compact oriented surface without 
boundary, and let p e M. Let C e M be a small circle surrounding p, and 
let l : C M — {p} be the inclusion map. Show that the induced map 
t* : // ( j R ( M — { /;}) — > H' lR (C) is zero. Hint: Stretch the hole caused by the 
puncture at p until it fills the inside of C and use Stokes’ theorem. At one 
point you will need to use the fact that f c : //J R (C) — > M is an isomorphism, 
a fact that you can prove either directly or else by wielding the hammer of de 
Rham’s theorem. 

6.15 Boundary manifolds Prove that the boundary of a manifold with boundary 
is a manifold without boundary. 
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The Editor is convinced that the notion of a connection in a vector bundle 
will soon find its way into a class on advanced calculus, as it is a funda- 
mental notion and its applications are wide-spread. His chapter “Vector 
Bundles with a Connection” hopefully will show that it is basically an 
elementary concept. 

S. S. Chern ' 


7.1 The definitions 

Let M be a differentiable manifold. Crudely put, a vector bundle is just a collection, 
or bundle, of vector spaces, one for each point p of M, that vary smoothly as p 
varies. For example, if M is a differentiable manifold and if T p M is the tangent 
space to M at a point p then the union T M of all the T p M as p varies over M 
is a vector bundle called the tangent bundle of M. Similarly, the cotangent bundle 
T*M of M is just the union of all the cotangent spaces T*M as p varies over M. 

Essentially, a vector bundle over M is a space E that looks locally like the Carte- 
sian product of M with a vector space. As we are primarily concerned with the local 
properties of vector bundles, we do not lose much by limiting ourselves to product 
bundles. But, for those who insist on knowing all the gory details, the official def- 
inition is provided here. The beginner should skim over the next two paragraphs 
and revisit them only as needed. 

A vector bundle (E , M , Y, jt) over a manifold M is a manifold E and a pro- 
jection map 7T : E — y M satisfying the following three axioms. (The axioms may 
make a bit more sense if you refer to Figure 7.1.) 


t S. S. Chern, Vector bundles with a connection, in Global Differential Geometry, Studies in Mathematics 
Vol. 27, ed. S. S. Chern (Mathematical Association of America, Washington, DC, 1989), p. ii. Copyright The 
Mathematical Association of America 2013. All rights reserved. Reprinted by permission. 
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(Bl) All the fibers are isomorphic. Each fiber p e M is isomorphic to 

a fixed vector space Y of dimension m ( m is called the rank of the vector 
bundle). 

(B2) The manifold E is locally a product. For all p e M there is a neighborhood 
U containing p and a diffeomorphisnr 

<p v \7t~\U) -* U x7 (7.1) 

(called a local trivialization of E over U). 

(B3) The local trivialization tpu carries fibers to fibers and the restriction is linear, 
i.e., 

Vu : tt~ l (p) -* Jtf'ip) (7.2) 

is linear for all p. The map 7Ti : U x Y — > U is a projection onto the first 
factor: (p, g ) p. 

The manifold E is called the total space or the bundle space, the manifold M is 
called the base space, and n is called the bundle projection of the vector bun- 
dle ( E , M, Y, 7 r). For brevity, one usually writes E — »■ M to denote the bundle 
( E , M, Y, 7r) if the projection map jt is obvious or understood. 

Suppose that U and V are two overlapping neighborhoods of p on M. Then we 
get a smooth map 

<Pv o (p~ l : {U n V) X Y (U n V) X Y. (7.3) 

By virtue of (B2) we have 


(« Pv o(p v l ){p,yu) = ( P,yv ), 


(7.4) 
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where y v = gvu(p)yu for some nondegenerate endomorphism gvu(p) '■ Y — Y 
that varies smoothly with p. The endomorphism g vu is called a bundle transi- 
tion function. If {U, V, W, . . . } is a covering of M, it follows that the transition 
functions satisfy the following cocycle conditions whenever the overlaps are 
nontrivial: 1 

(1) guu(p) = id, 

(2) guv(p)gvuip) = id, 

(3) guvip)gvw(p)gwuip) = id. 

EXERCISE 7.1 Prove that the transition functions satisfy properties (1), (2), and (3) 
above. 

A (smooth) section of I? is a (smooth) map 

s:M~* E (7.5) 

such that 

7r o s = id. (7.6) 

The definition implies that s can be viewed as a vector- valued map on M. In par- 
ticular, the space T (E) of all sections of E inherits a natural vector space structure. 
(Vector addition and scalar multiplication are defined pointwise.) 

Example 7.1 The simplest vector bundle is the trivial bundle, which is globally a 
product: E — M x Y. 

It is useful to have a criterion for triviality. This is provided by the next result. A 
(smooth) basis of sections over a subset U CM is a collection {e\, . . . ,e n ) of n 
sections that is linearly independent at each point of U. Such a smooth basis of 
sections is just what we have called a frame field. As E is locally a product, we can 
always find a basis of sections for any sufficiently small neighborhood of M. But 
what about larger neighborhoods? 

Theorem 7.1 A vector bundle E is trivial if and only if it has a global smooth 
basis of sections. 

Proof Assume E to be trivial. Then there is a global trivialization map tp : E — 
MxY which is smooth and preserves the linear structure on fibers. Pick a basis {/; } 
for Y, and define e, (p) := tp~ l (p, f). This gives n smooth sections on E, and they 

1 There is a reverse theorem that says roughly that any product structure glued together with transition functions 
satisfying the cocycle conditions defines a vector bundle. 
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form a basis for Jt~ l (p) because tp is a linear isomorphism of fibers. Conversely, 
suppose that {e, } is a basis of sections over M . Then any element of n~ [ (p) can be 
written v p = ^ Pick a basis { f) } e Y and define v = ^ /', v' . Then we 

get a map fr : E -» M xY given by i//(v p ) = (p,v). This is smooth and linear on 
fibers by construction. □ 

EXERCISE 7.2 Show that one triviality implies another: if E is trivial, we can 
choose guy — id for all U and V . 

Remark It can be shown (e.g. [38], Corollary 1.8) that a vector bundle over a 
contractible manifold is always trivial. 

Remark If TM is trivial then M is called parallelizable. Most manifolds are not 
parallelizable. For example it is known that, of all the //-spheres, only S°, S 1 , S 3 
and .S’ 7 are parallelizable, a fact that is intimately related to the fact that there are 
only four normed division algebras: the real numbers, the complex numbers, the 
quaternions, and the octonions. (See e.g. [20].) 

EXERCISE 7.3 (For readers who have completed the problems on Lie groups in 
Chapter 4.) Show that every Lie group is parallelizable, i.e., TG = G x T e G. 

Of course, not all bundles are trivial. 

Example 7.2 The Mobius band B that we encountered in Example 3.7 can be 
viewed as a nontrivial smooth vector bundle over the circle with fiber R. The coordi- 
nate charts used to define the manifold structure can be modified slightly to provide 
a local trivialization. Recall that B was defined to be the quotient R 2 / where 
(x, y) ~ (x + 1, —y). The circle S l is just R/ ~ where x ~ x + 1. Let [x, y] denote 
the equivalence class of (x, y) in B as before, and let [x] denote the equivalence class 
of x on the circle. Then the projection map n : B —»■ S l is given by [x, y] i->- [x]. 
Consider the open cover of S 1 given by 

U\ = {[x] : 0 < x < 2/3], 

U 2 = {[x]: 1/3 <x< 1], 

U 3 = {[x] : 2/3 < x < 4/3}. 

Define local trivializations q>i : Tt~ l {Ui) —>■[/,■ x R by <p,([x, y]) = ([x], y), where 
(x, v) is the unique representative of [x, y] in the parameter range corresponding to 
Ui. For example, *>i([l/3,y]) = ([1/3], y) but n ([4/3, y]) = ?>i([l/3, -y]) = 
([1/3], -v). 

Now consider the transition maps. Following in our old footsteps, we find that 
(<Pi ° <P 2 l ){[x], y) = ([x], y) = (pi o ^“^([x], y) but that (^ 3 o (pf h )([x], y) = 
([x], — y). Thus the bundle transition functions are gn = g 23 = 1 and = — 1. 
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(Observe that they satisfy the cocycle conditions.) This confirms the obvious fact that 
B is nontrivial. 

Example 7.3 The archetypical example of a vector bundle is the tangent bundle 
E = T M of a manifold M, which consists of the union of all the tangent spaces 
T p M together with the natural projection tt : T M — > M that sends T p M p. 
A point of TM may be thought of as a pair (p, Y p ) where p e M is a point and 
Y p e T p M is a tangent vector. Since dim T p M = dim M — n, the rank of TM is n. 
Sections of T M are precisely vector fields on M. In general, T M is nontrivial. 

If dimM = n, TM is a 2n -dimensional manifold whose bundle transition func- 
tions are precisely the Jacobian matrices relating two local charts of M. To see 
this, first observe that we may cover T M by the countable collection of open sets 
where {£/;} is a countable cover of M. Next, suppose that q and r are dis- 
tinct points in TM. If on the one hand n(q) ^ Tt(r) then choose a neighborhood 
U of 7t(q) and a neighborhood V of n(r) that are disjoint. (This is possible because 
M is Hausdorff.) Then n — 1 (C/) and tt~ ] (V ) are also disjoint. On the other hand, 
suppose that n(q) = jr(r) = p. Then q and r must be two distinct elements of T p M, 
so they have disjoint neighborhoods U q and U r in T p M (because vector spaces are 
Hausdorff). If U is a neighborhood of p in M then it~ l (U) CMJ q and 7T _1 (U) fl U r 
are disjoint open sets in T M. It follows that T M is Hausdorff. 

We coordinatize T M as follows. Pick a point q e TM and set p := jr(q). 
Let ( U , i j/jj) and ( V , i//y ) be overlapping charts of M containing p, with corre- 
sponding local coordinates m 1 , . . . , u" and v 1 , , v n , respectively. Let Y be a 

vector field on U and set Y jj = Y(u'). Define xu '■ -* R" x R" by 

q m - (u l (p), . . . , u"(p ), Yy(p), . . . , Yy(p)). Define xv similarly. Writing 



(7.7) 


j 


j 


we see that the manifold transition functions are 


XV ° Xu' = ifv ° ^u l ,gvu)- 


As these are diffeomorphisms, TM is indeed a 2n -dimensional manifold, as claimed. 
Moreover, the bundle transition functions guv are the Jacobian matrices of the 
coordinate charts of M, as promised. 

Example 7.4 Another important example is T = T*M. the cotangent bundle on 
M, whose fibers are just the cotangent spaces to M. The transition functions are just 
the inverses of the Jacobian matrices introduced in Example 7.3. The rank of T*M 
is the same as that of T M. Sections of T* M are 1-form fields on M. 

Example 7.5 Let E -> M and F -> M be two vector bundles over M. For 
simplicity, write E p to denote the fiber over p. Then the tensor product bundle 
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(E <g> F) -> M is the vector bundle over M whose fibers are just the tensor products 
of the individual fibers ( E ® F) p = E p <S) F p . If the transition functions of E — > M 
are guv and the transition functions of F — > M are h u y , the transition functions of 
(E 0 F) — »• M are just guv <8> huv- By iterating this construction we can obtain the 
tensor product bundle 

TM <g> • • • <g> TM®T*M <g> • • • 0 T*M -» M, 

v - J K v- ' 

r times s times 

sections of which are just tensor fields of type (r, s) on M. 


7.2 Connections 

Now we want to introduce an important device that will allow us to differentiate 
sections on vector bundles in a useful way. Consider for a moment the tensor prod- 
uct bundle T M® r ® T M*® s introduced in Example 7.5. A section of this bundle 
is just a tensor field, and ideally we want some kind of derivative operator that 
will act on a tensor field and give back another tensor field. As we have seen, one 
way to do this is to use the Lie derivative Cx- However, although the Lie deriva- 
tive is useful for some purposes, it suffers from a serious flaw: it is in some sense 
nonlocal. It depends, not just on X at a point, but on X in the neighborhood of 
the point. 

It turns out to be much more useful to have an analogue of the ordinary direc- 
tional derivative that depends only on the direction of X at a point. But, as we saw 
in Exercise 3.22, the ordinary directional derivative itself will not do because typi- 
cally the partial derivative of a tensor is not a tensor. As we have already exhausted 
all the reasonable definitions of differentiation that use only the manifold structure 
itself, we are compelled to introduce some additional structure to yield the right 
sort of derivative operator. On the tensor product bundle we have been discussing 
this is accomplished by means of something called a “covariant derivative”, which 
is a connection on a vector bundle. 

Although one does not need the notion of a tensor product bundle to discuss 
co variant derivatives, these are not the only bundles and connections that are 
important in mathematics and physics. Lor example, the standard model of particle 
physics is based upon the notion of gauge fields, which can be viewed as a con- 
nection on a special kind of vector bundle called an associated bundle.- It therefore 
behoves us to broaden our perspective slightly and discuss the general notion of a 
vector bundle connection from the start. 

- Unfortunately, a discussion of gauge fields from the point of view of modern differential geometry would 
require a chapter in itself. Instead, we refer the reader to [7] or [67]. See also Exercise 7.10. 


182 


Vector bundles 


A (linear) connection on a vector bundle E is essentially the simplest kind of 
operator that allows us to differentiate sections. 3 More precisely, a connection is a 
map 4 

D : T( E) -* T(E <g> T*M ) (7.8) 

satisfying the following two properties. For s, .V] , .v 2 G T(E) and / a smooth 
function on M , D obeys 

(Dl) (linearity) D(si + ^ 2 ) = Ds\ + Dsj, and 
(D2) (Leibniz rule) D{sf) = Ds ■ f + s ® df . 

Surprisingly, this simple definition has powerful consequences although it takes a 
while to see them. 

When E is the tensor product bundle T M 0r <g> T M*® s we want the connection 
to satisfy certain compatibility properties beyond the basic ones. This gives rise to 
what is usually called a covariant derivative, for which one uses slightly different 
notation. As a section of E is a tensor field, we write T instead of ,v. Then DT is a 
tensor valued 1-form and, for any vector field X, we write 

V x T:=i x DT, (7.9) 

where i x is the usual interior product. The result is another tensor field, the covari- 
ant derivative of T in the direction X. The axioms for a connection imply that for a 
function /, vector fields X and Y, and tensor fields S and T the covariant derivative 
operator satisfies 

(C 1 ) (linearity) V X (S + T) = V x S + V X T , and 
(C2) (Leibniz rule) V x (/T) = fV x T + ( Xf)T . 

In addition, by virtue of the definition of the covariant derivative and the function 
linearity of the interior product, we also have, for functions / and g, 

(C3) (function lineality on subscript) V f X+gY T = fV x T + gV Y T. 

It is the property (C3) that distinguishes the covariant derivative from the Lie 
derivative and which allows us to build tensorial objects from the covariant 
derivative. 

We will make further demands of a covariant derivative on a tensor product 
bundle, however. In particular, by analogy with the Lie derivative, we insist that 

3 The word “connection” is meant to suggest a device that “connects” the fibers above different points of a 
manifold. One can define more general bundles and more general connections on them, but we will not do so 
here; see e.g. [10] or [47]. 

4 Elements of r(.E <g) T*M ) can be thought of as “section-valued 1-forms” (or “1-form valued sections”). 
Viewing a 1-form as a linear functional on vector fields via the natural dual pairing, one can think of an 
element of r(E <g> T* M ) as an object that eats vector fields and spits out sections. 
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it be “natural” with respect to dual pairings and tensor products. Thus, for any 
function / and 1-form oo, we add the axioms 

(C4) (reduction to ordinary derivative on functions) V x f = Xf, 

(C5) (compatibility with dual pairing) V x (T, co) = (V X T, co) + (T, V x co), and 
(C6) (Leibniz rule II) V X (S ® T) = V X S ® T + 5 ® V X T. 

In Chapter 8 we will play around further with covariant derivatives. For now we 
content ourselves with the simple but important observation that properties (C4) 
to (C6) ensure that the covariant derivative on T M® r ® T M*® s is completely 
determined by the choice of covariant derivative on T M. (See Exercise 8.31.) 


7.3 Cartan ’s moving frames and connection forms 

Let us temporarily restrict our attention to a single neighborhood U of M where E 
is locally trivial. Because the bundle over U is trivial, Theorem 7.1 assures us that 
we can find a smooth basis of sections for the fiber Jt~ l (p) at each point. 

The connection maps each <?, to a section-valued 1-form. Because the e, form 
a basis for each fiber, De, must be a linear combination of the <?, with 1-form 
coefficients, written'' 

De t = y; C j <S> co-' j. (7.10) 

j 

The elements co' j are 1 -forms, and the entire collection of m 2 1 -forms constitutes 
a matrix co = (co' j), called the connection matrix relative to the frame field {e, }. 
For ease of writing, we introduce the following matrix notation (cf. Section 1.8): 

e = (ei ej ■ ■ ■ e m ) and co = (co' ,). (7.11) 

(In other words, e is a 1 x m matrix of sections while o> is a matrix of 1-forms.) 
Following Cartan, we then write equations (7.10) as 6 

De = eco. (7.12) 

The connection matrix determines the connection completely. To see this, let 

s = ^^e,cr' (7.13) 


5 Note that some authors write Dej = V’ j cop ® ej instead. This has the unfortunate consequence of flipping 
signs in some of the formulae that follow. 

6 In this equation the tensor product symbol is suppressed. It should really be written De = e ® co but people 
are lazy, so you may as well get used to it now. You just have to remember that the tensor product symbol is 
still there although invisible, otherwise you may be tempted to think that, when E = TM, e acts on co as a 
derivation, which would be nonsense. (Recall the discussion at the end of Section 3.7.) 


184 


Vector bundles 


be an arbitrary section. (Each o' is a smooth function on M.) Then, by properties 
(Dl) and (D2), 


Ds = D(ejcr') 
j 

= Dej ■ a ■' + ej <g> dcr' 

j 


Y.e, <S> j YtfjaJ + do 1 


(7.14) 


In other words, if we know the connection matrix then we know how D acts on 
arbitrary sections and vice versa. In shorthand notation (7.13) can be written 

s = eo, (7.15) 


where 



is anra x 1 matrix of smooth functions on M. Similarly, (7.14) becomes 

Ds = ecoo + edo. 


(7.16) 


(7.17) 


7.4 Curvature forms and the Bianchi identity 

Define a matrix of 2-forms by the shorthand formula 

£2 = du> + a> A co, (7.18) 


which means 

Q‘ j - d(J j + J2 a)i k A (7-19) 

k 

(Observe that the wedge product of the matrix of 1 -forms with itself does not van- 
ish!) The matrix elements of £2 are called curvature 2-forms. The significance of 
the term “curvature” will not become apparent until Chapter 8. 

Differentiating (7.18) using the ordinary derivative operator d and using (7.18) 
again gives 

d £2 = d 2 a> + dco Aw-® A da> = (£2 — w A ®) A ® - &>a(£2 — &>a co). 


Simplifying, we get the Bianchi identity 


<7£ 2 = £2a®-®a£2. 


(7.20) 


7.5 Change of basis 
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Equation (7.12) and (7.18) are known as Cartan’s equations of structure, and 
(7.20) is sometimes called an integrability condition. 


7.5 Change of basis 

Suppose that we had chosen a different frame field on U, related to the old one by 
a nondegenerate matrix A = {ci l j) of smooth functions. Then 

= Y2 e J aJ > ( 7 - 21 ) 

j 

or, in our matrix notation, 

e' = eA. (7.22) 

In the new basis e' the connection D is represented by a new matrix &>', where 

De' = eW, (7.23) 


meaning, of course, 


De't = J^e'j ®co’ j i. (7.24) 

By virtue of (7.12), (7.22), (7.23), and property (D2), we have 

eAof = e' of = De' = D(eA) = ( De)A + edA = ecoA + edA. (7.25) 
As the basis e is arbitrary, we can strip it from both sides to conclude that 

Aco' = dA + coA, (7.26) 


7 The phrase “integrability condition” is applied rather loosely here. Strictly speaking an integrability condition 
is the statement that a specific Pfaffian system is a differential ideal (see the discussion of Frobenius’ theorem 
in Appendix F.2). But the phrase is also used informally to refer to a situation in which differentiating a partial 
differential equation gives rise to a nontrivial necessary condition for the solution of the original equation. (This 
use of the word “integrable” arises by analogy with the Poincare lemma, which guarantees that, in any local 
neighborhood, dX = 0 implies that A . = dr] for some 77 ; in other words, dX = 0 is the integrability condition 
ensuring that X can be “integrated”.) In our case, we begin with the structure equation dco-\- co Aco = £2, which 
is a set of first-order differential equations for the connection 1 -forms eo l j in terms of the curvature 2 -forms . 
Differentiating this equation yields the Bianchi identity, which provides necessary (but, as can be shown, not 
sufficient) zeroth-order conditions that the connection 1 -forms o l j must satisfy in order to constitute a solution 
of the structure equation. (My thanks go to Robert Bryant for a discussion on this point.) It is interesting to 
observe that there are no additional integrability equations here, because applying the differential operator d to 
the Bianchi identity gives 0 = 0 identically. (The Bianchi identities play an important role in gauge theory and 
in Einstein’s theory of general relativity, where they can be interpreted as generalized conservation laws.) 
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or, left multiplying by A 1 , 

oJ = A~ l dA + A~ l o)A. (7.27) 

This important formula illustrates how the connection matrix changes under a 
change of frame field. By patching together connection matrices via this formula 
and a partition of unity it can be shown that connections exist. 8 

Under the change of basis, the curvature 2-form matrix changes in a particularly 
nice way. Differentiate the left-hand side of (7.26) and use (7.26) again to get 

dA A co' + AdoJ = (Aco' — coA) A u>' + Adco' 

= A(dco' + (o’ A a/) — A Aft/ 

= A^' -co A (dA + ft>A), (7.28) 


where 


n' = dco' + ft/ A &/. (7.29) 

Differentiate the right-hand side of (7.26) to get 

dcoA — toAdA. (7.30) 

Now equate (7.28) and (7.30) to obtain the transformation law 

An' = nA , (7.31) 


or 


n' = A~ l nA. 


(7.32) 


7.6 The curvature matrix and the curvature operator 

Let X and Y be two vector fields on M. From the curvature 2-forms we obtain the 
curvature matrix by taking inner products: 

n(X, Y) = i Y i x V. (7.33) 

Let s = ea and X, Y e T(TM). Define the curvature operator R(X. Y) : 
T(TM) Y(TM) by 


R(X, Y)s = en(X, Y)a. 


(7.34) 


For a proof see [19], p. 106. For the existence of a specific connection, see Theorem 8.4. 


7.6 The curvature operator 
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Note that it is linear by construction. The question, though, is whether it is well 
defined. The problem is that it appears to depend on the basis e. Suppose instead 
we had s = e' o' . Would we still get the same image? First we note by (7.22) that 
s = eo = e' o' = eAo' implies that o = Ao' or o' = A~ l o. Thus, using (7.32), 

e'Sl'o' = (eA)(A- 1 QA)(A~ 1 o) = eYlo, (7.35) 


so we do get the same image after all. 

Although (7.34) is elegant, in order to do certain computations it is useful to 
have an expression for it in terms of covariant derivatives. 

Theorem 7.2 As an operator on vector fields. 


tf(X,7) = V x V y -V y V z - V [Z , y] . (7.36) 


Proof Let s = eo be a section. Then from (7.9) and (7. 17) we have 

V x s = ixDs = i x D(eo ) = i x (ecoo + edo) = eco(X)o + eXo, (7.37) 

where oo(X) and Xo are defined entrywise. (That is, co(X ) 1 = co'j(X) and 
( Xo)i = Xof the latter meaning that X acts on the function o J as a derivation.) 
Similarly, we have 

V y s = eoo(Y)o + eYo = e[co(Y)o + Yo]. (7.38) 

It follows that 

V x V F s = eco(X)[co(Y)o + Yo] + eX[co(Y)o + Yo] 

= eco(X)co(Y)o + eco(X)Yo 

+ e[Xeo{Y)]o + eco(Y)Xo + eXYo 
= e{a>(X)co(Y) + co(X)Y + [Xco(Y)] + co(Y)X + XY]o. (7.39) 

Hence 


VxVy - VyV* - V[x,y] 

= e{co(X)co(Y) + o)(X)Y + [Xco(Y)] + co(Y)X + XY 
- co(Y)co(X) + a>(Y)X + [Tm(X)] + co(X)Y + YX 
-a>([X, Y])-[X,Y]}o 
= e{co(X)co(Y) - co(Y)co(X) 

+ [Xw(F)] - [Tm(X)] - co([X, Y])}cr. 


(7.40) 
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But, appealing to (3.123), 

Q(X, Y ) = (dco + co Aco)(X, y) 

= Xco(Y) - Yco(X) - co([X , B]) 

+ co(X)co(Y) -co(Y)co(X). (7.41) 

Comparison of (7.40) and (7.41), together with (7.34), yields the theorem. □ 

Informally, one hears the statement that curvature is a measure of the degree to 
which covariant derivatives (as opposed to simple partial derivatives) fail to com- 
mute, but the last term in (7.36) shows that this statement is not precisely true. Of 
course, it is true if one considers covariant derivatives along two distinct coordi- 
nate directions, because in that case X = 9, and Y = 9 ; - so the last term in (7.36) 
disappears. 

Recall that the interior product operator is function linear. It follows immedi- 
ately from (7.33) and (7.34) that the curvature operator is function linear as well. 
Specifically, for any functions /, g, and li and vector fields X, Y, and Z, 

R(fX, g Y)hZ = fghR(X, Y)Z. (7.42) 

We are therefore justified in referring to R(X, y) as the curvature tensor. 

EXERCISE 7.4 Prove (7.42) directly using the expression (7.36) together with the 
properties of the covariant derivative operator. (The point of this exercise is just to 
convince some doubters that an abstract definition can sometimes be more useful 
than a concrete one.) 


Additional exercises 

7.5 The unit tangent bundle of the 2-sphere Show that the bundle space of the 
unit tangent bundle of the 2-sphere S 2 is homeomorphic to SO( 3). Remark: 
It is actually diffeomorphic, but you need not show this. Hint: As usual, view 
S 2 as a submanifold of M 3 . Let x e S 2 and let y € T X S 2 be a unit tangent 
vector. Consider z = x x y, where x is the ordinary cross product. 

7.6 Parallelizability of the 3-sphere Show that the tangent bundle of the 
3-sphere S 3 is trivial by finding three linearly independent nowhere vanishing 
tangent vector fields. Hint: Define .S' 3 , as usual, by 


{{x\x 2 , x 3 , x 4 ) € M 4 : (JC 1 ) 2 + Ot 2 ) 2 + (JC 3 ) 2 + (JC 4 ) 2 = 1}. 
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Identify a tangent vector field with its components in the standard basis d/dx' . 
Show that (—x 2 , x 1 , —x 4 , x 3 ), (— x 3 , x 4 , x 1 , —x 2 ), and (— x 4 , —x 3 , x 2 ,x l ) do 
the trick. 

7.7 Direct sum of vector bundles Let E — »■ M and F M be two vector 
bundles over M. The Whitney sum E © F — ► M is the vector bundle whose 
fibers are the direct sums of the individual fibers (E © F) p = E p © F p . Show 
this indeed a vector bundle, and find the transition functions. 

7.8 Affine connections Let D and D be two connections on a vector bun- 
dle E with connection matrices co and a>, respectively. Let D , be an affine 
combination of D and D, so that 

D, — (1 — t)D + tD, t € M. 

Show that D, is again a connection, with connection matrix 


co, = co + tr), 

where rj := co — co. This exercise explains why D is often referred to as an 
“affine connection” (although this terminology is sometimes reserved for a 
connection on the tangent bundle). 

7.9 Characteristic classes Let E be a vector bundle over a manifold M with 
curvature form £2. 

(a) Show that tr £2 /l is invariant under a frame change, where 9 

£2* := Q A • ■ ■ A Q 

v ' 

k times 

and “tr” denotes the trace. It follows that b k := tr fT is a globally defined 
2k-form. 

(b) Show that b k is closed. Hint: You will need to use the Bianchi identity, 
the properties of the trace, and the commutation properties of forms. 

(c) Because b k is closed, it defines a cohomology class of the vector bun- 
dle E over M. w Show that b k is an invariant of E by showing that it 
does not depend on the connection used to define it. Hint: In the nota- 
tion of Section 7.5 let £2, £2, and £2, be the curvature forms associated 
to the connections co, co, and co,, respectively. We want to show that tr £2 A ' 

5 Written out explicitly, we have 

(S2*)‘ j = U ‘h A q!1 ! 2 a • • • a n‘t-1 j. 

10 These cohomology classes are called characteristic classes, and they play an important role in 
geometry and topology. (See Section 9.3 below.) On a complex vector bundle these classes are 
related, by Newton’s identities, to the celebrated Chem classes of M (see [18] or [64], Chapter 5). 
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and tr Q k differ by an exact form. So, show that a := tr(i/ A f^ -1 ) is a 
globally defined form on M and that d(tr Qf)/dt = kda. Integrate this 
to obtain the desired conclusion. 

7.10 Gauge theory The nucleus of an atom consists of protons and neutrons. 
Experimentally one observes that, aside from the obvious difference of their 
electric charges, protons and neutrons behave pretty much alike inside a 
nucleus. For this reason, Werner Heisenberg proposed that the two types 
of particles could be thought of as two faces of one type of particle called 
a nucleon. By analogy with spin- 1/2 particles, which can be either up or 
down relative to some axis, Eugene Wigner suggested that the nucleon could 
be thought of as possessing a new kind of property called isospin and that 
the two isospin states are related by a unitary transformation in a two- 
dimensional complex vector space called isospin space. To account for the 
observations, the law governing the interactions of two nucleons must there- 
fore be form invariant, or covariant, under the action of the Lie group SU (2), 
meaning that the law looks the same regardless of how the isospin state is 
transformed. 

But one person’s proton is another person’s neutron. To allow for the pos- 
sibility that different observers might disagree on the identity of a given 
nucleon, we permit the symmetry to be local, meaning that we are free to 
transform the nucleon state by different group elements at different points. 
To retain the idea of continuity (actually, differentiability) we demand that the 
group elements vary smoothly over the manifold. In other words, we gauge 
the St/ (2) symmetry. 11 

As you may have surmised by now, all this is intimately related to the 
theory of vector bundles. The nucleon field is modeled by a section of a 
vector bundle over a four-dimensional Lorentzian manifold (see Chapter 8), 
where the fibers are two-dimensional (complex) vector spaces, each of which 
carries the same representation of SU ( 2). As with all laws of physics, a 
differential equation governs the time evolution of the nucleon field, and 
the requirement that it be form invariant under local group transforma- 
tions means that we must use covariant derivatives instead of ordinary 
ones. As we have seen, the introduction of a covariant derivative leads 
to the appearance of a connection form (traditionally denoted by the let- 
ter “A”). This form is called a gauge potential, and it turns out here 

In the period 1918-19 Hermann Weyl wrote three papers on the possibility of local scale invari- 
ance in physics. In the last of these he called his idea Eichinvarianz. This was initially translated 
as “calibration invariance” and later as “gauge invariance”, probably because the English word 
“gauge” is employed to describe the different scales of such things as wires and railroad tracks. 
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to have a physical interpretation as the field that mediates the interaction 
between two proximate nucleons. It is now understood that nucleons are not 
fundamental particles but are instead composed of quarks. Generalizing the 
construction above leads to a gauge theory in which the matter fields are 
quarks and the gauge fields are gluons. The gauge group in that case is SU (3). 
Other gauge theories describe other fields. 

Formally, we may define a gauge field theory to be a vector bundle 
(E, M, V, it) over a smooth manifold M, where V is a vector space car- 
rying a representation of a Lie group G (called the gauge group). Sections 
of (E, M, V , 1 t) are called matter fields. A connection on the bundle gives 
rise to local connection forms A, called (local) gauge potentials , 12 obeying 
(7.10), (7.22), and (7.27), namely 

De = eA, e' = eg, and A' = g~ l dg + g~ l Ag. (7.43) 

The last two equations are referred to as gauge transformations in this 
context. 13 

(a) Let 4 / = ex// — ^2 a e a 4t a be a matter field. A gauge transformation is 
just a local change of basis, so it leaves 4* invariant. From 4>' = e'x//' — 
e\jr = 4> and e’ = eg we get \j/' = g~ l \fr. This is what a physicist calls 
a gauge transformation of the matter field. Continuing in (awful) physics 
notation, (7.14) reads 

D\fr a = dt 3 + A a h x// b , (7.44) 

where e u (D\\i a ) := e a (D 4>) a = D4 / . Given local coordinates {x,} on M 
we have A a b = Aj a h dx l . Defining ( A,-)% := 4,% and taking the inner 
product of (7.44) with 3/3 X/ gives 


D,i/y = 3 jiff + Aj\j/, 


(7.45) 


where D, := i d/dx iD. 

12 There are at least three different ways to view gauge potentials. The first two involve something 
called a principal bundle, which is beyond the scope of this book. (For details see [7] or [69].) The 
last approach, in terms of local connection forms, given here, is probably the easiest to understand. 
Mathematicians generally prefer the principal bundle approach, because it gives a very geometric 
way of understanding the meaning of a connection, while physicists generally prefer the local, 
more analytic, approach. Fortunately, all the approaches are ultimately equivalent. 

13 We are following traditional physics notation, so there is some unavoidable sloppiness. For exam- 
ple, we really ought to write e’ = ep(g ), where p is the representation of G on V. But fixing 
the representation means that G can be thought of as a matrix Lie group, so often one conflates 
the group element g with the matrix p(g) representing it. Note that g~^dg is a Maurer-Cartan 
form that naturally lives in the Lie algebra g of G. Also, we may interpret the last term in (7.43) 
as Adg(A), the adjoint action of the group on its Lie algebra. In this way we see that A', and 
therefore A, can be thought of as a Lie-algebra- valued 1-form. 
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Show that 

D^gx/f) = gD'x/r, (7.46) 

where D' t is defined using A' . This equation is the motivation for calling 

D, a gauge covariant derivative. 

(b) The field strength F is the curvature of the connection A. Show that the 
(matrix-valued) components of F in local coordinates are 

F i j = d i Aj-djA i -[A i ,A j ], (7.47) 

where F = \ Fjj dx' A dx' and the bracket [-, •] is defined by 

[A, B] = AB- BA. 

Show that (7.47) can also be written as 

Fij = [Dj, Dj], (7.48) 

(c) Show that the Bianchi identity (7.20) is equivalent to the Jacobi identity 

[Dj, [Dj, D*]] + [Dj, [D k , Di\\ + [D k , [Dj, Dj ]] = 0. (7.49) 

or, equivalently, 

[Dj , F jk \ + [Dj , F kl \ + [D k , Fij] = 0. (7.50) 

Remark: Although we do not do so here (see instead [6] or [7]), one can 
define a covariant analogue of the ordinary differential d, often denoted 
dfy, in terms of which the Bianchi identity (7.50) can be written 

d A F — 0. (7.51) 

Moreover, when sources are present we also have 

d A *F = *J. (7.52) 

The resulting equations (7.51) and (7.52) are called the Yang-Mills 
equations. They are a nonabelian generalization of Maxwell’s equations. 
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Une geometrie ne peut pas etre plus vraie qu’une autre; elle peut seule- 
ment etre plus commode. (One geometry cannot be more true than 
another; it can only be more convenient.) 

Henri Poincare, La science et I’hypothese 

As we have seen, a differentiable or smooth manifold M is just a topological space 
on which we have the ability to differentiate stuff. This is adequate if one is inter- 
ested only in differential topology, but to do more we need to introduce additional 
structure. This additional structure is a geometry. Basically, we want to be able 
to measure distances on M as well as determine the angles between vectors. Of 
course, the vectors don’t live on M - they live on the tangent spaces to M. What 
we need is an inner product on the tangent spaces. But we want to be able to 
differentiate things, so we want the inner product to vary smoothly. 

This leads us to the following definition. A smooth inner product or metric 
g on M is just a smooth map p g p , where g p is an inner product on T p M. 
A geometric manifold ( M , g) is a manifold M equipped with a smooth inner 
product g. Two geometric manifolds (M, g) and (A, h) are considered equivalent 
if they are isometric, meaning there exists a diffeomorphism (p : M N with 
(p^g = h. This amounts to saying that lengths of vectors and angles between pairs 
of vectors are preserved under (p, because 

g(X, Y) p = (<p*g)(<p*X, <p*Y) V ( p) = h(<p*X, <p*Y) v(p) . (8.1) 

As a symmetric bilinear form on T p M, the inner product g p has a definite sig- 
nature. 1 If the signature is (+,+,...,+) then the metric is positive definite, and 
M equipped with such a metric is called a Riemannian manifold. If the signature 
is anything else (except if it is , — )) then the metric is indefinite, and M 

1 Although the inner product itself will vary smoothly from point to point, its signature cannot change unless the 
inner product becomes degenerate at some point. This is a consequence of Sylvester’s law of inertia. 
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equipped with such a metric is called a pseudo-Riemannian manifold. Of partic- 
ular interest for physics are Lorentzian manifolds, which are manifolds equipped 
with a metric with signature (or equivalently, but less aestheti- 
cally, (+, — )). With a metric at our disposal we can do a lot of pretty 

geometry. 

For the lawyers, we should probably clear up one more point. 

Theorem 8.1 Every smooth manifold admits a Riemannian metric. 

Proof Let {p,} be a partition of unity subordinate to a locally finite open cover 
{Uf on M. Let h be the usual Euclidean metric on M" and define g, = <p*h, 
where tp t is the coordinate map on If. Then g := ^ p,g ; is a Riemannian metric 
on M. □ 

Remark The analogue of Theorem 8.1 does not hold for arbitrary signatures 
- there may be topological obstructions. (Can you see where the proof of The- 
orem 8.1 breaks down?) In fact, it is known that a compact manifold admits a 
Lorentzian metric if and only if its Euler characteristic vanishes. However, any 
nonconrpact manifold admits a Lorentzian metric ([39], p. 40). 


8.1 Index gymnastics 

In order to do some honest work, we have to (re-)introduce indices. The reason is 
that we need to manipulate components, and components require a definite frame 
field. We begin by discussing the two types of frame field on a manifold. 


8.1.1 Holonomic and nonholonomic frames 

A holonomic or coordinate frame field is one that is derived from a system of 
local coordinates. Specifically, if [x 1 , . . . , x"} are coordinates on a patch U then 

e t = di := (8.2) 

dx' 

is called a (local) coordinate frame on U. We are already familiar with how these 
work. They are local derivations in the coordinate directions. For example, 3i is a 
derivative in the direction in which the x 1 coordinate is increasing. Any vector field 
X can be written X = X 1 di, where the functions X' are the components of X with 
respect to the coordinate frame. 

A nonholonomic or noncoordinate frame field is one that cannot be derived 
from a local system of coordinates. Such fields are usually just written {e a }, where 
one chooses early Latin letters rather than middle Latin letters to indicate that one 
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is dealing with a general frame. The vector field X is now written X = X a e a , 
where the functions X a are the components of X with respect to the given frame. 

It is important to have a good working criterion for when a frame is holonomic. 
This is provided by the next theorem. 

Theorem 8.2 A frame field { e a } is locally holonomic ( a coordinate frame 
field) if and only if 


[ e a , eb ] = 0 for all a and b. (8.3) 

Proof If e a — if for some local coordinates {x a } then 

[e a ,e b ]-[d a ,d b \ = 0, (8.4) 

because mixed partials commute. Conversely, suppose that we have a frame field 
{e a } satisfying the hypothesis of the theorem. Let 9 a be the dual frame field, so that 
e a (e b ) = 8 a b . By (3.123), 

d9 a (e b , e c ) = e b 9\e c ) - e c 9 a (e b ) - 9 a ([e b , e c ]). (8.5) 

The first two terms vanish because the derivations e a give zero when acting on 
constants, and by hypothesis (and linearity) the last term vanishes. Hence 9 a is 
closed. By the Poincare lemma 9 a is locally exact, so there exist functions x“ such 
that 9 a = dx a . Hence dx 1 A • • • A dx n = 9 l A • • • A 9' 1 0 (because the 9 a are 

linearly independent), so by the inverse function theorem the x a form a good set of 
local coordinates. By duality, e a = d a . □ 


8.1.2 The line element and the metric tensor components 

Now that we know what a tensor field is, we can see that the metric g is one 
because it is a smoothly varying bilinear function on the tangent spaces to M. In a 
local coordinate basis the metric tensor can be written 

g = gij dx' ® dxf (8.6) 

where the components are given by 

gij = g(di,dj). (8.7) 

(Note that g is a symmetric tensor field because g,j = gjj, which follows from the 
symmetry of the inner product.) 


2 


Theorem 8.2 is closely related to an important theorem known as Frobenius’ theorem; we discuss this in detail 
in Appendix F. 
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Example 8.1 In Cartesian coordinates in Euclidean (respectively, Minkowski) 
space, the metric tensor components are Sjj (respectively, i], j)\ cf. (1.48) and 
(1.50). For this reason S ij (respectively, rjij) is often sloppily called the Euclidean 
(respectively, Minkowski) metric tensor. 


In a general basis {e a } and cobasis {6 “ } we have 

gab = g(e a ,e b ) ( 8 . 8 ) 

and 

g = g ab e a ®e b . (8.9) 

Note that the nondegeneracy of the inner product is equivalent to the nonsingularity 
of the matrices whose entries are the components of the metric tensor in the various 
frames. The frame {<?„} is orthonormal if 

8ab = ±*ii’ ( 8 - 1Q ) 

where we adopt the convention that indices with carets denote orthonormal bases 
or components. Orthonormal frame fields always exist locally. 


Theorem 8.3 In the neighborhood of any point of a geometric manifold (M, g) 
there exists an orthonormal frame field. 


Proof Pick a point p e M and let {x 1 } be local coordinates on a neighborhood U 
of p. Then { 3, } is a frame field on U . If (M, g) is Riemannian then Gram-Schmidt 
applied to this frame field yields an orthonormal frame field at every point. The only 
question is whether this can be done smoothly, and the answer is “yes”. Define the 
smooth vector field e\ = 3] /s/\g(‘<h , T )l- Next, define 


e ' 2 — d 2 - 


g(e\, d 2 ) 

e 

gie\,ef) 


and e , 


e 2 

s/\g{e' 2 ,e' 2 )\ 


and observe that e 2 is also a smooth vector field because it is constructed out of 
smooth quantities by smooth processes. Continuing in this way gives a smooth 
orthonormal frame field on U . 

If (M, g) is pseudo-Riemannian, the only problem that one might encounter is 
that at the / th stage one of the e\ might have zero length. In that case, just replace 3, 
by 3' = ff- afdj, where the af are smooth functions and such that d[ does not lie 
in the subspace spanned by {ei , . . . , <?,_ i }. By the nondegeneracy of the metric (see 
the first proof of Theorem 1.5) some choice of af will ensure that e\ has nonzero 
length. □ 
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A metric tensor allows us to calculate the distance 5 along any path y : I 
M by adding up all the infinitesimal distances along the path. This is easiest to 
understand if you think of a particle traveling along a curve y(t), where t denotes 
time. Its velocity vector is y{t) so its speed is ds/dt = \y(t)\ the length of the 
velocity vector. The distance it travels in a time t is therefore 

sit) = J ^dt = J \y(t)\dt. (8.11) 

On a curved surface the length of a tangent vector is determined by the metric, so 

s(t) = J yj gij(t)y' (t)y J (t) dt. (8.12) 


But the length of the path cannot depend on how fast it is traversed, and indeed if 
y (t') denotes a curve with the same image as y(t) but a different parameterization 
then we have 


sit') 



ds dt 
— — dt' 
dt dt ’ 


/ 


— dt = sit), 
dt 


For this reason we define the line element ds 2 by 


ds 2 = g = gij dx' dx 2 


(8.13) 


(where the tensor product symbol is usually suppressed). Then the distance 
between any two points along a curve is defined by pulling everything back to 
the domain of the curve: 

J ds = J y* ds = J J iy*gij)iy*dx‘)iy*dxj) - J yj dt. 

(8.14) 

You can think of ds := s/ds 1 as the distance between two infinitesimally 
separate points. For example, in three-dimensional Euclidean space ds = 
y/ dx 2 + dy 2 + dz 2 . 


8.1.3 Raising and lowering indices 

The metric is not just a passive object for measuring distances and angles. It also 
provides a means of tying together the vector space V and its dual V* via the 
isomorphism \// : T p M T*M taking X to giX, -)- : In other words, to every 
vector field X there exists a 1-form giX, •)■ What does this 1-form do? As we 
have seen, we may view a 1-form as a machine that eats vector fields and spits 

3 This is just the Riesz lemma. 
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out numbers (in a smooth way). The 1-form g(X , •) acts on, say, Y, and spits out 
g{X, Y) 4 

The map xj/ is both a blessing and a curse. It is a blessing because it allows us 
to simplify certain formulae, but it is also a curse because it blurs the distinction 
between covariant and contravariant indices, which can sometimes lead to confu- 
sion. To see how it works, consider a coordinate frame. Then, under x[r we have, 
from (8.13), 

X j dj = X -* g(X, •) = gij dx‘(X) <g> dx j (-) = gijX'dx' . (8.15) 

Next, we define the covariant components Xj of the vector field X by 

Xj := 8lJ X l , (8.16) 

in which case (8.15) can be written 

X j i)j — X jdx ' . (8.17) 

We say (with reference to (8.16)) that the index i has been lowered by the metric. 

The isomorphism xj/ induces a unique metric g on T*M, namely the one that 
makes the following diagram commute: 

T P M x T p M — ^ T*M x T*M 

s g 

g( ;■) §(;■)■ 

In other words, one gets the same answer following the arrows along either path. 
If we define 

g ij := g(dx‘,dx j ), (8.18) 

then the fact that the diagram commutes means that 

gij = g(di, 9/) = g(gikdx k , gjidx 1 ) = gikgjig U ■ (8.19) 

Using the symmetry of the metric we can rewrite (8. 19) as 

gij = gikg U gij- (8.20) 

Viewing the components of g as the entries of a matrix, (8.20) shows that g' J is the 
matrix inverse to gjj because this relation can only hold if 

gikg H = Sf and g kt gtj = S k j. (8.21) 


4 


Because the metric is symmetric, we could have written the map as X — > g(-, X)\ it makes no difference here 
although it does make a difference for complex manifolds equipped with a Hermitian inner product. 


8.2 The Levi-Civita connection 


199 


This in turn, makes it possible to use the inverse metric to raise indices, undoing 
what was done by (8.16). In particular', we have 

r = g ij Xj (8.22) 

because 

g ij Xj = g ij g jk X k = 8[X k = X‘ . (8.23) 

Similarly, we can raise and lower indices on any tensor using the metric in this 
way. For example, 

T lJ k = g U g jm T tmk and T ijk = g im g jn g kp T mn r . (8.24) 

It is worth emphasizing that, whereas we can always change the vertical positions 
of the indices in this way by using the metric, the horizontal positions are much 
less flexible. In fact, a statement like 7) ; = 7); is only possible if the tensor has a 
special property (namely symmetry). 


8.2 The Levi-Civita connection 
8.2.1 The Koszul formula 

Let M be a manifold. As we saw in Section 7.2, any covariant derivative V on T M 
satisfies three basic properties (C1)-(C3). By demanding a few more natural com- 
patibility properties, (C4)-(C6), we can extend the connection on T M to tensor 
product bundles of T M and T M*. In general there are infinitely many covari- 
ant derivatives on TM. To narrow down the possibilities, one usually imposes 
some additional restrictions on V. It is not obvious at this stage why one should 
impose the following conditions but they turn out to be useful in many applications, 
especially Einstein’s theory of general relativity. 

The first additional condition has to do with something called torsion, which is 
a generalization to manifolds of the twisting of curves in M 3 . Define a map 

r : T{TM) x T (TM) -» T(TM) 


given by 


r(X, Y) = V X Y - V y X - [X, 7]; (8.25) 


r is called the torsion tensor. 


EXERCISE 8.1 Verify that r is indeed a tensor by verifying that it is function linear 
in both arguments. 


5 


The idea of torsion is subtle and has engendered much confusion. For a discussion, see e.g. [40]. 
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We now insist that this tensor vanish everywhere, so we add the condition of 
(C7) (torsion-freeness) x(X. Y ) = 0. 

The second additional condition, whose significance becomes apparent later, is 
(C8) (metric compatibility) Xg(Y , Z) = g(V x Y, Z) + g(Y, V X Z). 

Theorem 8.4 There is a unique connection on T M, satisfying (C1)-(C3), (C7), 
and (C8), called the Levi-Civita connection (or, if the metric is positive definite, 

the Riemannian connection). 

First we prove a little lemma. 

Lemma 8.5 (Koszul) A connection satisfying (C1)-(C3), (C7), and (C8) obeys 

the Koszul formula, 

2g(V x Y, Z) =Xg(Y, Z) + Yg(X, Z) - Zg(X, Y) 

+ g([X, Y],Z )- g([X, Z], Y) - g([Y, Z], X). (8.26) 

Proof By metric compatibility 

Xg(Y, Z ) = g(V x Y, Z) + g(Y, V X Z), 

Yg(X, Z) = g(V y X, Z) + g(X, V Y Z), 

Zg(X, Y) = g(V z X, Y) + g(X, V Z Y), 

so the right-hand side of (8.26) becomes 

g(V*F + V Y X + (X, Y], Z) 

+ g(VxZ - V Z X - [X, Z], Y) 

+ g(VyZ-V z Y -[Y, Z], X). 

But this equals 2g(V x Y, Z) by torsion-freeness. □ 

Proof of Theorem 8.4 To prove the theorem, we define V by the Koszul formula. 
We must then show that it exists (i.e., that the operator V so defined is indeed a 
connection) and is unique. Uniqueness is obvious, because Z is arbitrary so any 
other covariant derivative V' satisfying (8.26) would have to equal V. To prove 
existence, we must show that V satisfies axioms (C1)-(C3), (Cl), and (C8). Actu- 
ally, we need only show that it satsifies axioms (C1)-(C3) because if it does then, 
by construction, since it satisfies (8.26) it must also be torsion free and metric 
compatible. 

The linearity of V is obvious from (8.26), because both the Lie bracket and 
the metric are bilinear. So (Cl) holds. To prove (C2) it suffices to prove function 
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linearity in the form V fX Y = fV x Y , because the same argument as before shows 
that Vx+yZ = V X Z + VyZ. By (8.26) we have 

2 g<y fx Y, Z) =fXg(Y, Z ) + Yg(fX, Z) - Zg(fX, Y ) 

+ g([fX, Y], Z) - g([fX, Z], Y) - g([Y, Z], fX). 
—fXg(Y, Z) + Y[fg(X, Z)] - Z[fg(X, F)] 

+ g(fXY-(Yf)X-fYX,Z ) 

- gifXZ - ( Zf)X - fZX, Y ) - /g([F, Z], X) 
=fXg(Y, Z) + (F/)g(X, Z) + fYg(X, Z) 

- (Zf)g(X, Y ) - / Zg(X, F) 

+ /g([X, F], Z) - (Yf)g(X, Z) 

- /g([X, Z], F) + (Zf)g(X, F) - /g([F, Z], X) 

= 2 fg{V x Y, Z). 

Property (C3) can be proved similarly; this is left to the reader. □ 

EXERCISE 8.2 Let {e a } be a frame field and let {9 a } be its dual frame field on M. 
Show that, in terms of these bases, the torsion tensor can be written 

x(e b ,e c ) = '^2,T a (e b ,e c )e a , (8.27) 

where the r a are 1 -forms given by 

x a =d6 a + co a b Ad b . (8.28) 

In a more compact notation one writes 

r = dO + co A 0, (8.29) 

where r is called the torsion form of the connection, relative to the basis (9. 6 Clearly 
the torsion tensor and the torsion form embody the same information. Remark: One 
sometimes sees the even shorter shorthand notation 

r = DO, (8.30) 

where the action of the connection D on 1-forms is defined as follows: 

DX\=dX + m AX. (8.31) 

In this notation the curvature form becomes 

Q. = Deo. (8.32) 

However, this notation can be a bit tricky so we shall eschew its use. If you’re curious 
about how it works, look at [7], pp. 250-252. 


6 


Strictly speaking, r is not a 1-form but, rather, a collection of 1 -forms x a . 
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8.2.2 The Cliristoffel symbols and structure functions 


Let {e a } be an arbitrary frame field and {0 a } the dual frame field, and let V be the 
Levi-Civita connection. We define the Christoffel symbols Y c ah by the formula 

Ve a e b = r c ab e c . (8.33) 

The Christoffel symbols are intimately related to the connection forms. We have 

Ve a e b = ie a De b = i Ca (e c ® of b ) = o) c b (e a )e c , (8.34) 

so, comparing (8.33) and (8.34), we get 

T c ab = oo c b (e a ) or of b = T c ab 6 a . (8.35) 

The Christoffel symbols play an important role in geometry, because they tell you 
how vectors twist and turn as you move about the manifold. 

EXERCISE 8.3 Show that 

v ea o c = -r c ab e b . (8.36) 

We now derive an explicit expression for the Christoffel symbols in terms of 
simpler quantities. Associated to any frame field {e a } are its structure functions 
c c ab , defined by 

[e a , e b ] = c c ah e c (8.37) 

where the expression on the left is the usual Lie bracket of vector fields. 

Theorem 8.6 The Christoffel symbols are determined uniquely by the metric 
tensor and the structure functions. 

Proof Let X = e a , Y = e b , and Z = e c and apply the Koszul formula (8.26) 
to get 

2 g(V ea e b , e c ) = e a g(e b , e c ) + e b g(e a , e c ) - e c g(e a , e b ) 

+ g([e a , e b ], e c ) - g(\_e a , e c ], e b ) - g([e b , e c ], e a ). 

In terms of components this reads (using the bilinearity of the metric) 

abgdc — t*agb C C bgac tt c g ab T C abgdc C acgdb C bcgda- 

This equation is the result we seek. It can be written more compactly by using the 
metric tensor to lower all the indices: 

Ycab — tf^ a Sbc + tt b g ac e c g ab T c cab c bac c ab( f). (8.38) 

□ 
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As an important special case of the theorem, we observe by Theorem 8.2 that 
the structure functions vanish in a coordinate (holonomic) basis. In that case (8.38) 
simplifies to give 

r*y = ^(digjk + 3 jgik - 3 kgij) = gjk.i + gikj - gij.k )• (8-39) 

Notice a handy feature of the Christoffel symbols in a coordinate basis, namely 
that they are symmetric in the last two indices: 

r k ij = r k ji. (8.40) 

One word of warning. The Christoffel symbols look like the components of a tensor, 
but they are not. That is, they would appear to be tensor components because they 
have all those indices, but they fail to satisfy the correct transformation property 
(3.53). This fact follows from (7.27) and (8.35), but it is also instructive to see it 
directly. 


EXERCISE 8.4 Starting from the definition 

S7 a .dj = r k ijd k (8.41) 


of the Christoffel symbols in a coordinate basis (with 9; := d/dx'), verify that the 
T k jj are not the components of any tensor field by showing that, under a coordinate 
transformation x l y' , 


_ dy k ' 3x‘ dx-i k dy k ' 9V 
' 2 dx k dy 1 ' dyT ‘ 2 dxi dy 1 ' dyT 


(8.42) 


EXERCISE 8.5 Consider the unit 2-sphere S 2 equipped with the standard line 
element 


ds 2 = dO 2 + sin 2 0 df 2 , (8.43) 

where 6 and <p are the usual spherical polar coordinates in R 3 . (As previously noted, 
these coordinates are not well defined at the poles, so their use implies that we are 
standing somewhere else on the sphere.) Show that the only nonzero Christoffel 
symbols for the 2-sphere metric are 

r 6 ^ = — sin0 cosd and = cotd. 

Remark: For a derivation of (8.43), see Exercise 8.47. 


EXERCISE 8.6 Let {e^} be an orthonormal frame field on some neighborhood U 
of p e M, and let a> and C be the connection and curvature matrices, respectively, 
relative to this frame. Show that both matrices are skew symmetric when their indices 
are lowered with the metric. That is, show that co -g := gac<*> c u satisfies co~: = —cot-, 
with a similar result holding for C. Hint: Use metric compatibility. Remark: This 
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result holds irrespective of the signature of the metric, but a similar statement is 
false if M is pseudo-Riemannian and the indices are not both lowered. For example, 
of t f —of ^ in general; see Exercise 8.40. However, if M is Riemannian (so that the 
metric is positive definite) then we have both of ^ = —co b b and = — £l b These 
last equations refer to individual components and, as should be clear, are not meant 
to imply a tensor equality, so in the Riemannian case it is better simply to write 
co T = —a> and = — fT Note that all these results apply only to orthonormal 
frames. 

EXERCISE 8.7 As before, let {e« } be a local orthonormal frame field around p e M. 
Assume the metric to be positive definite. By virtue of Exercise 8.6, co is a skew 
symmetric matrix (of 1 -forms). Show that if co = A~ l d A for some matrix A of 
smooth functions with A(p) = I (the identity matrix) then A must be an orthogonal 
matrix. 


8.3 The Riemann curvature tensor 

Exercise 8.4 shows that the Christoffel symbols are nontensorial and consequently 
appear to be of limited utility. However, they can be combined in a special way to 
form an important tensor, namely the Riemann curvature tensor, an object of great 
utility. 

The Riemann curvature tensor is just the curvature tensor of the Levi-Civita 
connection. The significance of the Riemann curvature tensor is certainly not obvi- 
ous at this stage but, intuitively, it measures how much the manifold curves in 
various directions. Here we discuss a few simple consequences of the definition 
and later we investigate its properties more thoroughly. 

In a general basis {e a } the components of the Riemann curvature tensor are 
defined by 

R(e c , ed)eb = R a bcde a ■ (8.44) 

EXERCISE 8.8 Let { e a , 6 b ] be a pair of dual bases. Starting from (7.34), show that 
the curvature 2-forms can be written 

n a b = ^R a bcdO c A9 d . (8.45) 

EXERCISE 8.9 Using (7.36) and (8.44) show that, in a coordinate (holonomic) 
basis, the components of the Riemann curvature tensor are 

R'jke = r l ij,k - r + ri kmf m tj - r i lm r m kj , (8.46) 

where V j k are the Christoffel symbols. 

The Riemann curvature tensor satisfies various symmetry properties, some of 
which are obvious and some of which are not. They are 
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R(X, Y)Z = —R(Y, X)Z, 
g(W, R(X, Y)Z) = -g(Z, R(X, Y)W), 
R(X, Y)Z + R(Y , Z)X + R(Z, X)Y = 0, 


(8.47) 

(8.48) 

(8.49) 


and 


g(W, R(X, Y)Z ) = g(X, R(W, Z)Y). 


(8.50) 


Equation (8.47) is clear from (7.36). To prove (8.48) we simplify things by observ- 
ing that, by tensoriality, it suffices to prove the result in a local coordinate basis. 
In particular, we may assume that all brackets such as [X, Y\ vanish. By metric 
compatibility 


g(.w, V X V F Z) = Xg(W, VyZ) - g(V X W, VyZ), 


so that 


g(W, R(X, Y)Z ) = Xg(W, V Y Z) - Yg{W. V X Z) 

-g{V X W, VyZ)+g{VyW , VxZ). 


Now add this to the same expression with W and Z interchanged. After obvi- 
ous cancellations and another appeal to metric compatibility the right-hand side 
becomes 


XYg(W, Z) - YXg(W, Z) = [X, Y]g(W, Z) = 0. 


For (8.49), we again assume that the brackets vanish. Expanding the left-hand side 
gives 

V^VyZ — Vy V^Z + Vy V Z A — V z VyX + V z V z y — V Z V Z T. 

By torsion-freeness (e.g. VyZ = — V Z T) the terms in this expression all cancel 
pairwise. The last property is left as the following exercise. 

EXERCISE 8.10 Prove (8.50) using (8.47)-(8.49). Hint: Take the inner product of 
(8.49) with a fixed vector, then add and subtract various permutations of it and use 
the other properties. 

It is useful to translate the symmetry conditions (8.47)-(8.50) into statements 
about the (downstairs) components of the Riemann curvature tensor, say in a 
coordinate basis. Defining Rjju '■= gt m R m jki we have: 

(1) Riju — — R i jik (antisymmetry in the last two indices); 

(2) Riju = — R jiki (antisymmetry in the first two indices); 

(3) Ri\jkt\ = 0 (total antisymmetry in the last three indices); 
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(4) Rijkt = Rkiij (symmetry under the interchange of the first and last pair of 
indices). 

These follow directly from (8.47)-(8.50) on choosing W = 9,-, X = if, Y = 
di, and Z = dj. The only exception is property (3), which requires the use of 
property (1). 


EXERCISE 8.11 Prove property (3) above. 


EXERCISE 8.12 By counting the number of constraints, show that the Riemann 
tensor has only 


n 2 (n 2 - 1) 
12 


(8.51) 


(rather than n 4 ) independent components in n dimensions. Hint: By property (1) 
there are only N := (") = n(n — 1) /2 choices for the last two indices, because 
there are that many ways of choosing two distinct objects from a list of n objects. By 
property (4) the Riemann tensor can be viewed as a symmetric N x N matrix (which 
means that the conditions in property (2) are redundant and can be discarded). Argue 
that property (3), after accounting for properties (1) and (4), imposes Q) conditions. 


EXERCISE 8.13 Using the standard coordinates on the 2-sphere, show that 
= sin 2 9 and R^e^e = 1 and that all the other independent components 
vanish. Hint: Use Exercise 8.5 and (8.46). 


8.4 More curvature tensors 

From the Riemann curvature tensor we can construct various other curvature ten- 
sors by contraction. In a coordinate basis the components of the Ricci tensor R ( / 
are defined by 

Rij := R k ikj . (8.52) 

EXERCISE 8.14 Show that the Ricci tensor is symmetric. 

EXERCISE 8.15 Compute the components of the Ricci tensor for the 2-sphere in 
standard coordinates. 

EXERCISE 8.16 Define 

Ric(X, Y) = tr(Z R(Z , Y)X). (8.53) 

Show that Ric is precisely the Ricci tensor. That is, show that Ric(9/9x ! , 
d/dxJ ) = Rij. 
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The Ricci curvature scalar is 



(8.54) 


Note that this number is the same in all coordinate systems. It is therefore called 
a curvature invariant. Note that R' J Rjj is also a curvature scalar built from the 
Ricci tensor, but it has no standard name. Many other curvature invariants exist. 

EXERCISE 8.17 Show that R — 2 for the unit 2-sphere. 

Although this is not a book about general relativity, it would be gross negli- 
gence to have reached this point and yet not to write down Einstein’s equations; for 
completeness we will do so here. The components of the Einstein tensor G i; - are 
defined by 


Gtj := R u - 1 Rgij . 


(8.55) 


Einstein’s famous equations of general relativity can then be written (in natural 
units) as 


Gij = SnTij, 


(8.56) 


where T ; / are the components of something called the stress-energy tensor, which 
contains all the information about the distribution of mass-energy and momentum 
in spacetime. 

Einstein’s equations (8.56), together with the constraints imposed on the Einstein 
tensor by the Bianchi identity, comprise n{n — l)/2 coupled, nonlinear partial dif- 
ferential equations for the metric tensor components in terms of the stress-energy 
tensor components. Despite their complexity, in all cases in which they can be 
solved, either exactly or approximately, they successfully predict the behavior of 
every observable large scale phenomenon in the universe; they also predict the 
shape of the universe itself. Einstein’s equations are sometimes summarized by 
the (entirely too cavalier) slogan “geometry = mass”, because they predict that 
spacetime is curved by the presence of matter. 

7 Remarks (i) A scalar is an element of the underlying field F, but a scalar field is a coordinate invariant function 
on a manifold. (How could a function not be coordinate invariant? The answer is, if it is a pseudoscalar. See the 
discussion in the footnote indicated after (8.130).) Sometimes people are sloppy and call a scalar field a scalar. 
Mea culpa, (ii) Some authors would say that R is a diffeomorphism invariant, but this is misleading. Recalling 
the discussion in the footnote indicated before (3.14), we could indeed say that R is a diffeomorphism invariant 
in the sense that “diffeomorphism” is just the active way of stating “coordinate transformation”. But if we think 
of a diffeomorphism as a smooth deformation of the manifold, it would seem to engender a contradiction. After 
all, a squashed sphere has a different geometry from a standard sphere. But this is the point: when those authors 
say “diffeomorphism invariant” they mean that both the manifold and the metric are changed simultaneously by 
the diffeomorphism. In that case, R is indeed invariant. If we were to just squash the sphere but not change the 
metric accordingly (by, for example, embedding both the standard sphere and the squashed sphere in Euclidean 
space and demanding that they each inherit their respective metrics from the ambient Euclidean metric) then R 
would certainly not be invariant. For this reason, it is best not to call R a diffeomorphism invariant. However, 
certain integrals of curvature scalars are indeed diffeomorphism invariants in the stronger sense because they 
are invariant under arbitrary smooth deformations of the manifold, no matter what metric is used. In other 
words, they are smooth topological invariants. See Theorem 9.6 below. 
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Remark Another striking application of the Ricci tensor is to a fundamen- 
tal question of topology. A topological space is simply connected if it is path 
connected and if every simple closed curve can be contracted to a point while 
remaining in the space. Simple connectivity can be used to distinguish two topo- 
logically distinct manifolds. For example, both the 2-sphere and the 2-torus are 
compact and without boundary, but the 2-sphere is simply connected while the 
2-torus is not. The question then becomes, if a manifold (of any dimension) is 
compact, boundaryless, and simply connected is it a (topological) sphere? 

The famous mathematician Flenri Poincare showed that this is true for two- 
dimensional manifolds, and in 1904 he conjectured that the answer was also true in 
three dimensions. 8 For almost one hundred years Poincare’s notorious conjecture 
resisted all attempts to solve it, until Grigory Perelman used Richard Flamilton’s 
technique of Ricci flow to construct a proof around 2003. The basic idea of Ricci 
flow is to treat the curvature of a manifold as something that flows from one point 
to another (akin to the flow of heat) while preserving the original topology. 9 10 The 
hope was that one could then show that a 3-manifold with the desired proper- 
ties would flow into a manifold with constant positive curvature (which, given the 
other properties, has to be a 3-sphere). The details are extremely delicate, but ulti- 
mately Perelman’s argument was judged successful. For a good popular discussion 
of the Poincare conjecture and its history, see [72] or [83]. For the technical details, 
see [63]. 


8.5 Flat manifolds 

In the next few sections we briefly investigate the significance of curvature. We 
begin by trying to characterize spaces that are not curved, so we start with 
Euclidean space. Intuitively, Euclidean space is “flat” in all directions. Thus we 
would expect the Riemann curvature tensor of Euclidean space to vanish. Fortu- 
nately, it does. The Euclidean metric tensor in Cartesian coordinates has constant 
components gjj = 8/j so, by (8.39), all the Christoffel symbols vanish where- 
upon by (8.46) Rijki = 0- But a tensor that vanishes in one coordinate system 
vanishes in all coordinate systems, because by construction tensors are coordinate 
independent. 1 " 


8 In dimensions four and above, the analogous conjecture (that any compact, boundaryless, simply con- 
nected and homologically trivial manifold is a sphere) was affirmed in the 1960s through the efforts of 
mathematicians such as Smale, Stallings, Wallace, Zeeman, and Freedman. For more about this story, see 
[59]. 

^ The basic equation of Ricci flow is dgij /dt = —2 Rfj . 

10 The same thing holds for Minkowski space, as well as for other spaces with globally constant metric tensor 
components. To avoid having to discuss spaces of every possible signature we mostly restrict our attention in 
this section to Riemannian spaces. 
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By definition, a space whose Riemann curvature tensor vanishes everywhere is 
called a flat space. We have just seen that Euclidean space is flat. Is this the 
only flat space there is? Interestingly, the answer is no. For example, you will 
hear people say that the cylinder and the torus are also flat, which sounds absurd 
on the face of it but is true in a certain sense. We really have to be a bit more 
precise. 

The question of whether a geometric manifold is flat depends on its metric and 
only indirectly on its topology. A topological space equipped with one metric may 
be flat, but equipped with another it may not be. Consider, for example, the torus. 
If we think of torus as a doughnut in M 3 with its metric inherited from the ambient 
Euclidean metric, then it is not flat. (See Exercise 8.49). But, if the torus is defined 
as M 2 /Z 2 and equipped with the metric ds 2 = dx 2 + dy 2 , where x and y are the 
standard coordinates defined in Exercise 3.16 (see also Exercise 3.23), then it is 
definitely flat. 1 2 

For this reason it would be better to avoid saying that a manifold is or is not flat 
and instead say that a manifold is or is not equipped with a flat metric. In fact, it 
would be best to speak of a geometric manifold (M, g) as being flat or not. But 
everyone speaks of flat manifolds, so we will do so as well, remembering that the 
statement always refers to a manifold with a metric. 

This begs two questions. First, how could a manifold that looks curved actu- 
ally be flat? For example, the standard torus looks curved, and in fact is curved. 
But the standard right circular cylinder in M 3 looks curved, and yet is flat (see 
Exercise H.8). The resolution of this conundrum lies in distinguishing two kinds 
of curvatures, intrinsic and extrinsic. The intrinsic curvature of a manifold is the 
curvature that an observer living in the manifold would measure (see Section 8.6 
below for exactly how one might do this), whereas the extrinsic curvature depends 
on how the manifold is embedded in a higher-dimensional space. The cylinder has 
zero intrinsic curvature but nonzero extrinsic curvature. In this chapter we focus 
exclusively on intrinsic curvature, so when we speak of “curvature” we are always 
referring to the intrinsic curvature. But both types are important, and there are deep 
connections between them. For more about this, see Appendix H. 

The second question is, when does a manifold admit a flat metric? For example, 
no matter how hard you try, you cannot put a flat metric on a topological sphere. 
To answer this question one needs the following result, which is of sufficient 
importance that we present it here. 

11 Note that the Riemann tensor is built from the metric via the (torsion free, metric compatible) Levi-Civita 
connection, so our definition of flat space assumes the use of the Levi-Civita connection. More generally one 
may speak of a vector bundle equipped with a flat connection (a connection with zero curvature), but we do 
not do so because we are focused here on geometric manifolds. 

12 This flat torus can only be embedded isometrically in R 4 . 
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Theorem 8.7 A manifold M is flat if and only if every point of M has a neigh- 
borhood isometric to an open subset of Euclidean space. Put another way, M is 
flat if and only if every point has a coordinate neighborhood U such that gjj = 8jj 
everywhere on U . 1 

Basically this theorem says that the only way to get a flat manifold is to patch 
together undistorted bits of Euclidean space without kinks or bends. How could 
you do this and get anything but Euclidean space? We basically gave it away above. 
Observe that the flat cylinder may be viewed as the quotient M 2 /Z, while the flat 
2-torus was defined as M 2 /Z 2 (both with the induced metric). It turns out that these 
examples are typical, in the sense that every flat manifold can be obtained from 
Euclidean space by identifying various points appropriately. 14 Thus, flat manifolds 
are locally identical to, but globally distinct from. Euclidean space. 15 

Remark Because a flat manifold is only locally isometric to Euclidean space, 
one sometimes defines a manifold whose Riemann tensor vanishes to be “locally 
flat” and reserves the word “flat” for Euclidean space itself. We do not use this 
terminology here, because it conflicts with standard usage in general relativity that 
all manifolds are locally flat in the sense that they all admit metrics that are flat to 
first order. (See Appendix D.) 

We have already seen that if the metric tensor has constant components then the 
Riemann tensor vanishes, so one part of the proof of Theorem 8.7 is immediate. 
The converse, however, is much more involved. We offer two proofs, one here 
and one in Section 8.8, because each is instructive. The proof given here uses the 
language of differential forms and relies upon the Frobenius theorem. 1 First we 
need the following lemma. 

Lemma 8.8 Let { e a } be a frame field on a neighborhood U of a point p, and 
let oo and £2 be the corresponding matrices of connection and curvature forms, 
respectively. If £2 = 0 everywhere on U, there exists a unique matrix A of smooth 
functions on U with A(p) = 1 (the identity matrix) satisfying 

co = A~ l dA. (8.57) 


13 If the signature is Lorentzian, replace Sij by iyj . 

14 Technically, complete flat manifolds are quotients of Euclidean space by the free action of some discrete 
subgroup of the Euclidean group of rigid motions. (These manifolds are of particular importance in the field 
of crystallography.) For more information, see e.g. [91]. 

15 We said earlier that manifolds “look like” Euclidean space near a point, but that only means they are locally 
homeomorphic to a piece of R” ; it does not say anything about distances or angles, which are generally 
distorted by a homeomorphism, nor does it say anything about global topology. 

16 See Appendix F. See also [25], Section 7.4, which we follow here. 
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Proof Uniqueness is easy. If oo = B 1 d B with B(p) = I then 

d(AB~ l ) = (dA)B~ l - AB~ l dB B~ l = ( Aeo)B - A (ooB~ l ) = 0. 

Hence AB~ { = AB~ l (p ) = /, whence A = B. 

For the existence part, we embed M in an (n + n 2 ) -dimensional manifold E, 
equipped with n local coordinates x 1 , . . . , x" from U and n 2 additional coordinates 
zf (1 < i, j < n ) chosen to satisfy the constraint det Z f 0. where Z := (z l j ). 11 
Define an n x n matrix of 1 -forms on E by 

'F = dZ — Zoo. 


Then, as dco = —co A oo, 

t/'F = — dZ A <w — Z dco = — ('F + Zoo) A oo + Zoo A oo = — 'F A oo. 

Therefore, by Frobenius' theorem, the distribution T = 0 is completely integrable, 
with integral submanifold Sc E. This means that we can solve the differential 
equations vF = 0 to get S. But those equations read 

dZ = Zoo , 

which can be viewed as differential equations for the Zij in terms of the x l . By 
integrability, there is a matrix A of functions of v such that Z = A solves this 
system subject to some initial condition that we may choose to be A(p) = I. 
Substituting A for Z gives 

dA = Aoo => oo = A~ l dA. 


□ 

Proof of Theorem 8.7 We want to show that vanishing curvature implies a flat 
metric. Let {<?,}} be an orthonormal frame field on some subset U , with dual frame 
field {9 a }. (The dual frame field is automatically orthonormal.) By Lemma 8.8 there 
exists a matrix A of smooth functions on U with oo = A~ l d A, which by virtue of 
Exercise 8.7 must be orthogonal. Define a new coframe field by 

v = AO. (8.58) 

As A is orthogonal and 9 is orthonormal, v is also orthonormal. By (8.57), 

dv = dA /\9 + Ad9 = Aoo/\9 — A oo A 9 = 0, (8.59) 

where we have used the fact that the torsion form (8.29) vanishes. Hence each 1- 
form v l is closed, so by the Poincare lemma there exist functions x‘ on U such 


17 


The cognoscenti will recognize a principal bundle lurking about. 
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that v‘ = dx' . It follows immediately that the dual frame field d/dx' is also 
orthonormal. In particular, g t j = <5 iy everywhere on U . 18 □ 


8.6 Parallel transport and geodesics 

Suppose that you were living in a particular manifold. How could you tell whether 
it is flat or curved? It turns out there are several techniques one can use, each of 
which is helpful in different circumstances. They all involve the idea of parallel 
transport. 

Let / c R be an interval, and let y : I — > M be a parameterized curve in M 
with tangent vector field X = y*(d/dt). Let Y be any other vector field on M. We 
say that Y is parallel transported (or parallel translated) along y if V X Y = 0. 
Intuitively, Y is parallel transported along y if it remains “as parallel to itself as 
possible” along y , so that the only change in Y along the curve is that induced by 
the curvature of the manifold in which it is situated. 14 

It is instructive to examine the equation for parallel transport in local coordinates. 
Suppose that X = X'3, and Y = Y j dj. Then 

V X T = S7 x (Y j dj) 

= X (Y-i)dj + Y J V x dj 
= X (Y')dj + Y i X i V di d J 
= [X(Y k ) + r k iJ X i Y j ]3*. 

Hence X x Y = 0, provided that 

X(Y k ) + T k ij X i Y i = 0. (8.60) 


But (cf. (3.95)) 


r = W) = r.\j) M = £(*' ° r) = ^ 


so (8.60) can be written as 


^—Y k + r k ii —^—Y J = 0 


. d vLyi 


dt 


dt 


or, equivalently, 


dY‘ fd Y k , A 

Jrb + r ^>= a 


(8.61) 


(8.62) 


(8.63) 


18 When the metric is indefinite we need to consider the positive definite and negative definite subspaces 
separately. 

19 For an extensive discussion of the geometry of parallel transport and its relation to covariant derivatives, see 
Chapter 10 of [60]. 
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Figure 8. 1 Parallel transport in Euclidean space. 


An easy special case occurs when the manifold is ordinary Euclidean space. In 
Cartesian coordinates the Christoffel symbols vanish, and the covariant derivative 
reduces to the ordinary partial derivative. In that case parallel transport just means 
that the Cartesian components of Y do not change along the curve, which is the 
same thing as saying that Y remains parallel along the curve. (See Figure 8.1.) 

EXERCISE 8.18 Show that parallel translation depends only on the curve y and not 
on its parameterization. 

EXERCISE 8.19 Let Y and Z be two vector fields parallel translated along the same 
curve. Show that their lengths and the angle between them remain constant along the 
curve. Hint: The length of Y is g(Y. Y ) l ' /2 , and the cosine of the angle between Y 
and Z is g(Y, Z)/[g{Y. Y)g(Z , Z)] 1 / 2 . 


Note that (8.62) is linear in the components of Y, so in principle it admits a 
solution for all t. This means that, given a curve y (?) and a fixed tangent vector T 0 
at y (0), solving (8.62) yields a vector field Y (necessarily unique) such that F /( ,) is 
the parallel transported continuation of T 0 along the length of the curve. The map 
)>, : T y(0) M -> Ty(t)M given by T 0 f-* Yy(t) is called the parallel transport (or 
parallel translation) map. It depends on both the curve y and the connection V. : " 

The curve y is a geodesic (or autoparallel) if X x X = 0. Alternatively, a 
geodesic is a curve whose tangent vector is parallel transported along itself. On 
a Riemannian manifold, geodesics are the shortest-distance paths between any two 


points. 21 Substituting X for Y in (8.62) gives the geodesic equation. 


d 2 y k 

dt 2 


+ r* 


dy' dyi 
' dt dt 


= 0 . 


The geodesic equation is also written as 


2 v k 


d 2 x 


dt 2 


+ r /( 


dx 1 dx ] 
dt dt 


= 0 , 


(8.64) 


(8.65) 


20 We investigate the parallel transport map in more detail in Section 8.8. For an instance of the parallel transport 
map on the sphere, see Exercise 8.41. 

21 The proof of this is not too difficult, but we omit it because it requires variational calculus. 
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where as usual, x‘(t) = ( y*x‘)(t ) = y'(t). Observe that (8.65) is nonlinear. This 
means that, in general, the geodesic equation is much more difficult to solve than 
the parallel transport equation, and questions about how far a geodesic can be 
extended are often quite subtle. 

EXERCISE 8.20 Show that a geodesic does depend on its parameterization, by 
showing that the only transformation of parameters leaving (8.64) (or (8.65)) form 
invariant are affine parameter changes: t — »■ s = at + b. In other words, two 
curves could have the same images but different parameterizations and one could 
be a geodesic while the other is not. For this reason, the ‘t’ in (8.64) (or (8.65)) is 
called an affine parameter. 

EXERCISE 8.21 Show that, for any geodesic, 



( 8 . 66 ) 


or, in other words, ( ds/dt ) 2 = C for some constant C which, by rescaling, we 
may choose to satisfy \C\ = 1. On a Lorentzian manifold we have three choices: (i) 
C = — 1, in which case t = r is the proper time along the curve: (ii) C = 0, in which 
case the curve is a null geodesic (and t is any affine parameter); and (iii) C = 1, in 
which case t = s is the proper distance along the curve or, equivalently, the curve 
is arc length parameterized. (The latter is the only possibility if the manifold is 
Riemannian.) Hint: This is essentially a reformulation of an earlier exercise. 

In Euclidean space in Cartesian coordinates the Christoffel symbols vanish, and 
(8.65) reduces to 


whose solution is a straight line. Even on nonflat manifolds it turns out that we can 
always choose coordinates for which the Christoffel symbols vanish at a point, 22 so 
locally a geodesic is just a straight line. 23 Globally a geodesic may be curved, but 
locally it is straight. Therefore geodesics arc the closest analogues of straight lines 
on a manifold. Also, we can think of Vy A as the “acceleration vector” of the curve, 
because that’s what it reduces to in Euclidean space. In general relativity, a particle 
that experiences no net force has no acceleration and so follows a geodesic. 24 

22 See Appendix D. 

22 Technically, this is not quite correct. Even if the Christoffel symbols vanish at a some point they may not 
vanish at a nearby point, so integrating (8.65) would be problematic. Fortunately, it can be shown that there 
exist coordinates, called Fermi normal coordinates, that have the property that the Christoffel symbols vanish 
in the neighborhood of any point along a geodesic. For a discussion see [60], §13.6. 

24 Gravity is not a force per se in general relativity. Instead, spacetime curvature is the manifestation of grav- 
ity. Particles moving under the gravitational influence of other bodies and subject to no other forces follow 
geodesics in a curved Lorentzian manifold. 
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Thinking physically for a moment, imagine two proximate bodies dropped near the 
Earth’s surface. They both fall straight down, so their trajectories are parallel. But 
if we place the same two bodies far above the Earth’s surface and let them go, each 
one is pulled towards the center of the Earth along trajectories that would meet at 
the Earth’s center. In Einstein’s theory of general relativity, this occurs because the 
Earth’s mass causes spacetime to curve around the planet and the two freely falling 
bodies move along geodesics in this curved spacetime. Thus, the effect of curvature 
is to cause geodesic deviation, which measures how nearby geodesics converge or 
diverge. 

Consider a family y(t, s) of geodesics. Here t is the parameter distance along 
a single curve, while s labels the different geodesics, of the family. We therefore 
have a family of tangent vector fields X s associated to each geodesic, as well as tan- 
gent vector fields J, connecting corresponding points on nearby geodesics, called 

Jacobi fields. 

The key feature of the Jacobi fields J , is that they commute with the fields X s , 
so that 


[X„7 f ] = 0. 


( 8 . 68 ) 


This follows because we may think of X s as d/dt and J, as d/ds, and mixed 
partial derivatives commute. 25 Using (7.36) (and dropping the subscripts s and t 
for typographical ease) we can write 


V x V 7 X - V 7 V x X = R(X, J)X. 


(8.69) 


However, V is torsion free so, by (8.68), 


V X J - VjX = [X, /] = 0. 


(8.70) 


Combining (8.69) and (8.70) gives 


VxV x y-V 7 V z X = R(X,J)X\ 


(8.71) 


but the curves are all geodesics, so V x X = 0 and (8.7 1) reduces to 


V X V X J = R(X, J)X, 


(8.72) 


25 


Alternatively, the X s vectors for nearby geodesics are “Lie dragged” along the integral curves of Jt (and vice 
versa), so £v Jt = 0- Equivalently (cf. Appendix F), y(t, 5 ) is a two-dimensional integral submanifold of M. 
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which is known as the geodesic deviation equation or Jacobi’s equation. It mea- 
sures how J changes as one moves along the geodesics. For example, in flat space 
the curvature tensor vanishes and (8.72) reduces to 


d 2 J‘ 
dt 2 


= 0 , 


whose solution is linear: J 1 = a' t + b‘ .In flat space, initially parallel lines remain 
parallel (Euclid’s parallel postulate). But, in the presence of nontrivial curvature, 
Euclid’s postulate fails; the resulting geometries are therefore called nonEuclidean. 


8.8 Holonomy 

Geodesic deviation is a local phenomenon having to do with how nearby geodesics 
behave. There is a more global way to detect the presence of curvature that is some- 
times quite useful; it employs parallel transport around closed curves. 21. We have 
seen that, given a connection V and a curve y , we get a map i), between tangent 
spaces that maps vectors to vectors by parallel transport. What would happen if we 
were to carry a vector around a closed curve? The answer is that it may not return 
to its original state. The amount by which it differs from its original state is related 
to the curvature of the manifold. 

To illustrate, let’s see what happens on the unit 2-sphere S 2 . By Exercise 8.42, 
the geodesics of S 2 are just great circles. Consider a geodesic triangle on the sphere 
consisting of the three great circle arcs y lt y 2 , and y 3 , where y\ starts at the north 
pole and goes down a meridian (longitude line) to the equator, y 2 goes along the 
equator to the meridian at i fr, and y 3 goes along the meridian at f back to the north 
pole. Let A be the tangent vector to y\ pointing away from the north pole. (See 
Figure 8.2.) 

By definition, the tangent vector of a geodesic is parallel transported along itself. 
Let X, be the tangent vector to y, . Then at the north pole A = X i . It points due 
south on the zeroth longitude. After parallel transport along yi , A is still X t , so 
it still points due south. Now we transport A along y 2 . By Exercise 8.19 parallel 
transport preserves the angle between vectors, so A maintains its angle with X 2 at 
all times. At the start of its journey it is pointing due south at 90 degrees to X 2 , so at 
the end of its journey it is still pointing due south. Finally, during its journey back 
along y 3 it remains pointing due south, so when it returns to the north pole it is 
pointing at an angle of x/r relative to its original direction. If we had tried this little 
exercise in flat space the vector A would have returned to its original value, so the 


26 


In this section we assume that M is connected. 
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Figure 8.2 Parallel transport of a vector about a geodesic triangle on the sphere. 


angular deviation of a vector after parallel transport on a manifold is an indication 
that we are no longer in fiat space. 27 

To discover the relationship between holonomy and curvature, we require an 
explicit form for the parallel transport operator. The differential equation (8.62) 
defining parallel transport can be written as 

dY 

1- AY = 0, (8.73) 

dt 

where Y is a vector and A is a matrix with components A k j — Y k ij{dy l /dt). The 
thing that makes this equation tricky is that A depends on t (through y). If it did 
not, the solution of (8.73) would just be 

Y(t) = e~ tA Y( 0), (8.74) 


where for simplicity we write Y (l) instead of Y ( y{t )) = Yym; the exponential of 
a matrix is defined by its power series expansion: 

,4 ( tA ) 2 

e~ ,A = 1 - tA + — — ± • • • . (8.75) 

2 ! 

But in fact A generally does depend on t, so we need to be more clever. 

The way forward is to try an iterative solution. Write (8.73) as 


dY 

dt 


= -AY, 


■ 7 Nature abounds with examples of holonomy. For example, one can view the rotation of the plane of oscillation 
of a Foucault pendulum as being due to parallel translation around a latitude (see [36] or [71]). The relevant 
computation is performed in Exercise 8.41. Many other examples can be found in the compendium [76]. 
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and integrate both sides to get 

Y(t) = Y( 0)- f dhA(h)Y(h), 

Jo 

where A(t ) := A (y(t)). Now substitute the left-hand side into the right-hand side 
to get 


Y(t) = Y (0) - f 
Jo 

= m - 

Continuing in this manner we get 


dt\ A (f i ) 

[ dt\ A{t\) 
Jo 


dti A{t 2 )Y (tf) 

'h 


y(0) - f 

Jo 

E(0) + f dh f dt 2 A(ti)A(t 2 )Y (t 2 ). 
Jo Jo 


Y(t) = 


00 r< ft i pt n - i 

y'(-i)" dh dt 2 ... 

»=0 J 0 JO JO 


dt n A(h) ■ ■ ■ A(t n ) 


Y( 0). (8.76) 


Provided A does not blow up anywhere along y, this infinite sum converges to a 
solution of (8.73). 

Equation (8.76) is often rewritten using a suggestive kind of shorthand notation. 
You will notice that the matrices are ordered in such a way that “later is on the 
left”, which means t\ < t 2 < ■ ■ ■ < t n \ this motivates the introduction of a path 
ordering operator V, which reorders any matrix product in such a way that “later 
is on the left”. If i\ < ■ ■ ■ <i n , and if a e 6„ is any permutation, 

V (A ( tiam ) ■ ■ • A^J) = A(t h ) • • • A(tJ. (8.77) 


With this notation the multiple integral in (8.76) can be written 


1 

n ! 




dt n P(A(h) ■ ■ ■ A(t n )), 


or, more simply (and with a shocking abuse of notation), 


1 

n ! 


V 



(8.78) 


(8.79) 


EXERCISE 8.22 Prove that (8.78) does indeed equal the multiple integral in (8.76). 


Finally, then, we define the path ordered exponential by 


v.-£ T^oi-p (f' Ml) d X 
ti » ! j 


(8.80) 
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whereupon (8.76) can be written, by analogy with (8.74) (to which it reduces when 
A is constant), 

7(f) = Ve~ti A(,)d, Y( 0). (8.81) 

Thus, we see that the path ordered exponential operator is precisely the parallel 
transport map ft, (see the text before (8.64)): 

ft,=Ve-ti mdt . (8.82) 

More generally, we write 

ft st (y, V) : =re-f° A(t)dt (8.83) 

for the operator parallel transporting vectors along y from T y ( s )M to T y ( t) M 
relative to the connection V. If y — y\ ■ ■ ■ y m is a piecewise smooth path then 

#0,1 (y, V) = #0,1 (Ym, V) • • • #0,1 (yu V). (8.84) 

By Exercise 8.18, this is independent of the parameterizations of the individual 
pieces and therefore depends only on the curve y (and the connection V). 

EXERCISE 8.23 The inverse of a curve y is the curve y~ l given by y _1 (f) = 
y (1 — t ). The trivial curve (at p) is the map e p : [0, 1] — > M with e p (t) = p. Show 
that &(e p , V) = id and that #(y _1 , V) = [ft(y, V)] -1 . Hint: For the second claim 
recall that A{t) = T k jj(dy l / dt). Note that yy~ { e p . 

Now consider the set C p of all loops based at p. 29 Although these loops do not 
form a group (because, for example, for nontrivial y g C p we have yy~ l ^ e p ), 
Exercise 8.23 shows that their holonomies do, because the product of two loops is 
another loop. The holonomy group of V based at p is 

//(V; p) := {ft(y, V) : y e C p }. (8.85) 

EXERCISE 8.24 Show that if y is a curve with y( 0) = p and y( 1) = q then 

//(V; q) = ft(y, V)H(V; p)ft(y~\ V), (8.86) 

i.e., every H ’ e //(V; q) is of the form for H e //(V; p). One says that 

holonomy groups at different basepoints are conjugate. 

The restricted holonomy group //(V; p) of V based at p is the holonomy group 
of all the contractible loops at p. It is clearly a subgroup of the full holonomy 
group. It is also geometrically significant. 


For the definition of a product of paths, see Exercise 4.11. 

A loop based at p is just a path that starts and ends at p. See Exercise 4. 1 1 . 
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Theorem 8.9 //(V ; p) is trivial ( i.e., it consists only of the identity element) if 

and only if the curvature vanishes everywhere. 

Proof In Appendix E we show that the holonomy around an infinitesimal 
coordinate loop of parameter area or centered at a point p can be written 

d = 1 +a 2 TZ + 0(a 3 ), (8.87) 

where 7 Z is basically the Riemann curvature tensor at p and (D(a 3 ) represents terms 
of order a 3 and higher. It then follows that if the holonomy group is trivial then the 
curvature tensor must vanish at every point. 

The converse follows from a famous result of Ambrose and Singer [3] (see 
also [47]), which is unfortunately beyond the scope of this discussion. Using only 
(8.87) we can give a “physicist-style” proof. 30 Consider a coordinate square of side 
length L. Divide it up into N 2 smaller squares of side length L/N. By traversing 
the perimeter of each square in the same direction, one ends up walking around 
the individual loops in such a way that the internal edges are traversed in oppo- 
site directions. Thus, the holonomy of the larger loop is just the product of the 
holonomies of the smaller loops (because the holonomies along the internal edges 
cancel pairwise). If the curvature tensor vanishes then from (8.87) the holonomy 
of the large square is approximately 

d as (1 + 0((L/N) 3 )) n2 « 1 + 0(1/N), 

which vanishes in the limit that N — »■ oo. (This trick only works if the large loop 
is contractible, which is why that condition is required.) ’ 1 □ 

Remark The theorem is not true if we replace the restricted holonomy group by 
the full holonomy group because, even if the curvature tensor vanishes, a noncon- 
tractible loop can still have a nontrivial holonomy if the manifold fails to be simply 
connected. An example is provided by the Aharanov-Bohm effect. (Look it up!) 

We can now take care of some unfinished business. 

Second proof of Theorem 8.7 Pick a point p e M and a vector X p e T p M. If 
the Riemann curvature tensor vanishes then, by Theorem 8.9, parallel transport is 
path independent. This means that we obtain a unique vector field X by parallel 
transporting X p from one place to another in M. Moreover, since X is parallel 
transported along every curve, 87 yX = 0 for any vector field Y . In this way we may 

The following argument is adapted from [86] . 

31 Of course, the converse would also follow from Theorem 8.7 but, as we wish to use Theorem 8.9 to prove 
Theorem 8.7, we cannot very well invoke Theorem 8.7 here. 

32 This is just the usual argument. Let y\ and yi be two paths from p to q, say. Then, as parallel transport around 
a closed loop is trivial, 1 = ftpq(n)&qp(yf l ) = Vpq(y\Wpq(.Y 2 )V l , so &pq(Y l) = &pq(.Yl)- 
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extend any vector at p to a covariantly constant vector field on M. In particular, 
given an orthonormal frame Xi, . . . , X p we can extend it in the same fashion to 
a global frame field on M. As parallel transport preserves lengths and angles, this 
frame field is everywhere orthonormal so, with respect to this frame, gjj = 8jj. As 
the connection is torsion free we have 

[X,- , X /] = V Xi X j - V X j Xi =0, (8.88) 

so by Theorem 8.2 there exist local coordinates x' such that X, = d/dx' . These 
are the coordinates we seek. □ 

This proof reveals the relationship between holonomic frame fields and holon- 
omy: if the holonomy group is trivial then every frame field is holonomic (on some 
neighborhood), whereas this is not true if there is nontrivial holonomy. 


8.9 Hodge theory 

In this section we assume that M is an n -dimensional compact, orientable, Rie- 
mannian manifold without boundary. In particular, we assume that the metric is 
positive definite. It is a deep result of Hodge that, on such a manifold, every A: -form 
admits a canonical decomposition with very nice properties and this decomposi- 
tion provides information about the cohomology of M. We briefly examine some 
of these ideas here. 

Let g be the metric on M. In terms of local coordinates x l ,...,x n on a 
neighborhood U of a point p, we define an /i-form 

a = Vg dx l A • • • A dx'\ G :=det gij. (8.89) 

As M is orientable, this form extends to a global nowhere-vanishing //-form on M. 
It is called the (canonical) volume form of M . 33 

EXERCISE 8.25 Verify this last assertion, namely that a extends to a global 
nowhere-vanishing //-form. Hint: You may want to complete Exercise 8.52(c) first. 

EXERCISE 8.26 Let 0 1 , ... ,0 n be an orthonormal basis of 1 -forms on a neighbor- 
hood U of p. Show that on U we may write 

a=6 l A---A0". (8.90) 

The manifold M is equipped with a metric, so we have an inner product in each 
tangent space T p M as well as in each cotangent space T*M. Suppose that we have 

33 Equation (8.89) also defines the canonical volume form on a pseudo-Riemannian manifold, provided one 
replaces J G by s/\ G\. 
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a k-form to on M. Then, as in Section 2.9, we can define its Hodge dual *a> to be 
the unique n — k form satisfying 

co A [a = /x)cr (8.91) 

for any ( n — /O-form /i, where a is the volume form defined above and g is the 
induced metric (2.55) applied to (n — k)-forms. All the properties of the Hodge star 
operator then carry through as before. 

The set Q k (M) of all differential k-forms on M is a vector space under the 
operations of pointwise addition and scalar multiplication. Hodge’s fruitful idea 
was to turn this space into an inner product space using his star operator. Thus, 
given i),A e £2 k (M), we define their inner product by 

(rj,X) — I tj A +X. (8.92) 

Jm 

By (2.61) (and the fact that (— l) rf = 1 on a Riemannian manifold) 

(» 7 , A.) = f g(rj, X)o. (8.93) 

Jm 

It follows that Hodge’s inner product is positive definite. 

Now define a new operator 8 : £2 k {M) — > Q. k ~ l {M) by the rule 

8rj = (-l) nk+n+1 *d + rj. (8.94) 

The operator 8 is sometimes called the co-differential operator. 

EXERCISE 8.27 Show that 8 2 = 0. 

The key fact about the operator 8 is that it is precisely the adjoint of the differential 
operator d with respect to the Hodge inner product: 

Theorem 8.10 If rj e Q k (M) and X e £2 k+l (M) then 

(dr], X) = (rj, 8X). (8.95) 


Proof The manifold M is compact without boundary so, by Stokes’ theorem and 
the properties of the exterior derivative operator, 

0= / d(rjA*X)= / dr/ A *X + (— 1)* / rjAd-kX. 

Jm Jm Jm 

Note that A is a (k+ I )-form, so *A is an (n—k — l)-form and d *X is an (n-k)- form. 
Thus, from the definition (8.94), 


8X = (— l) n(i+1)+ " +1 -kd *A = (—l) nk+1 +d*X. 
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Employing (2.60) gives 

*5A = (-1 ) nk+l * 2 d*X = (-l) nk+1 (-l)^ n ~ k U*k = (~l)* 2+1 r/*A. 
Combining everything we obtain 

0=(di 1 ,X) + (-l) k(k+1)+ \r 1 ,8X), 

from which the conclusion follows. □ 

Next we introduce the elegant Laplace-de Rham operator (sometimes also 
called the Hodge-de Rham Laplacian), 

A := dS + 8d, (8.96) 

which generalizes the usual Laplacian operator. ' We say that a fc-form co is closed 
if dco = 0, co-closed if Sco = 0, and harmonic if A co = 0. Obviously, if co is 
closed and co-closed then it is harmonic. Interestingly, the converse also holds. If 
co is harmonic then, by Theorem 8. 10, 

0 = (A co, co) = ( dSco , co) + ( Sdco , co) = (Sco, Sco) + (dco, dco). 

But the inner product is positive definite, so both terms on the right must vanish 
separately and this can only happen if dco = Sco = 0. 

Remark It follows that A is an elliptic (positive definite) self-adjoint differential 
operator. That is, (AA, rj) = (A, At]) and (Aw, co) > 0 with equality holding only 
if A co = 0. 

EXERCISE 8.28 Assume that M is connected and that dim M = n. Show that any 
harmonic function on M is necessarily a constant function. Furthermore, show that 
any harmonic n-form is a constant multiple of the volume form. Hint for the second 
part : Any top-dimensional form is a smooth function times the volume form. Now 
use the first part together the fact (which you should prove) that * A = A*. 

This brings us to Hodge’s powerful result. 

Theorem 8.11 (Hodge decomposition theorem) Let M be a compact, orientable, 
Riemannian manifold without boundary. Any k-form co on M can be decomposed 
uniquely into the sum of a closed form, a co-closed form, and a harmonic form: 

co = da + 8/3 + y. (8.97) 


34 


See Exercise 8.56. 
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Proof Uniqueness is easy. By linearity, it suffices to show that 

da + 8/3 + y = 0 (8.98) 

implies that da = 8/3 = y = 0. Applying d to (8.98) (and remembering that y is 
harmonic and therefore closed) gives 

d&p = 0 => (dSp, p) = 0 => (5/3, 8f) = 0 => 8/3 = 0. (8.99) 

Applying 8 to (8.98) gives da = 0 by a similar argument, so y = 0 as well. 
Existence boils down to proving an existence theorem for the solution of an elliptic 
second-order partial differential equation, so we merely refer the intrepid reader to 
([29], Appendix C, or [88], §6.8). □ 

If co is closed then the same argument as given above shows that we can decom- 
pose co uniquely as co = da + y. In other words, co can be expressed in a canonical 
way as the sum of a harmonic form and an exact form. In particular, y is a coho- 
mology class representative for [ a> ] . Therefore, we have another important theorem 
of Hodge. 

Theorem 8.12 (Hodge) Let M be a compact, orientable, Riemannian manifold 
M without boundary. Every closed form co on M on has a unique harmonic coho- 
mology class representative. In other words, the de Rham cohomology H^ R (M) is 
isomorphic to Y/ C (M), the space of harmonic k-forms on M. 

We can use Hodge’s theorem to prove an important result in cohomology. 

Theorem 8.13 (Poincare duality) Let M be a compact, connected, orientable, 
n -dimensional manifold without boundary. For every k, 

H^ k (M) = H k R (M)*. (8.100) 

In particular, /3„_r- = (f, where /T = dim is the kth Betti number of M. 

Proof Let q e Ll k (M) and A e be closed. There is a natural pairing 

between H k R (M) and H'^ k (M), given by 

(toLM) := f n A A. (8.101) 

Jm 

This bilinear form is well defined, as can be shown in Exercise 8.29 below. It is 
also nondegenerate. To show this, we must show that for any nonzero class [rj] 
there exists a nonzero class [A] such that ([/j], [A]) f 0. 
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This can be done as follows. Pick a Riemannian metric on M. 35 Let [rj] be 
a nonzero class. By Theorem 8.12 we may assume that // is harmonic. Define 
X := *i'j. It is closed because t] is co-closed and, by construction, 

(M, M) = (»/,»/) ^ 0 (8.102) 

so by the Riesz lemma the proof is complete. □ 

EXERCISE 8.29 Show that the pairing (8.101) is well defined (i.e., independent of 
class representative). 

EXERCISE 8.30 Show that a closed (compact, without boundary) connected ori- 
entable manifold is never contractible. Hint: What does Poincare duality say about 
the top-dimensional cohomology? 


Additional exercises 


8.31 Covariant derivatives of tensor fields and semicolon notation. In a coor- 
dinate basis, the covariant derivative of a tensor field T of type ( r , 5 ) has 
components 


(V*rr~v..y, 

'~ri\...i r 

~ 1 ji-js,k 

tii—ir 


+ r n uT 

- Y\ h T i ' - ir 


n-js 

t-h-js 


kt* 

r j 1 i\...ir 


+ 


- ^kjJ n - lr M...j s + 


1 p'V pil-.-ir-R 

" + 1 kl 1 h...j s 

p^ ...i r 

1 kjs 1 


(8.103) 


where V& := Va k and := 3,/. In shorthand notation (which, in fair- 
ness, is used by both mathematicians and physicists), the left-hand side of 
this expression is often written 

or T il - ir h ... js . k . (8.104) 

The latter expression defines the meaning of the semicolon notation. 

Confirm (8. 103) in a special case by verifying that it holds for a tensor field 
of type (1, 1). In other words, show that 

Vy,k = - r l kJ r e . ( 8 . 105 ) 


Confirm the general formula if you really like indices. 


35 


We can do this by virtue of Theorem 8.1. Apparently lawyers are useful for something. 
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8.32 Commas to semicolons Show that the expression for the Lie derivative 
given in (3.124) remains valid if the ordinary derivatives are replaced every- 
where by covariant derivatives or, equivalently, if the commas in (3.124) are 
replaced by semicolons (as defined in the previous exercise). 

8.33 Covariant constancy of the metric Show that the metric tensor is covari- 
antly constant, namely that 

Vxg = 0, (8.106) 

by showing that, in a coordinate basis, = 0. Hint: Use (8. 103) and metric 
compatibility. 

8.34 Covariant derivatives commute with contractions In a coordinate basis, 
the raising and lowering of indices commutes with semicolons. Confirm this 
assertion in a special case by verifying the following equation: 

gimFj* = T mj , t . (8.107) 

Hint: Use the previous exercise. 

8.35 The Bianchi identity revisited Show that, in a coordinate basis, the 
Bianchi identity (7.20) becomes 

R l j[kl;m] = 0, (8.108) 

where, as usual, square brackets denote antisymmetrization. Equivalently, 
show that the Bianchi identity can also be written as 

[V,, [V £ , VJ] + [V,, [V n> V,]] + [V m , [V,, V,]] = 0. 

Hint: You may wish to use equations (8.35), (8.45), and (8.103). 

8.36 Ricci identity For any tensor field ^ n "' lr ji...j s of type (r, s), Ricci’s iden- 
tity is 


p = i 


- l jj q+ i...j.* J j 9 U> (8.109) 

9=1 


where \A, B ] := A B — BA. Prove this formula for the special case r = 2, 
s = 0. Warning: Be careful - partial derivatives and Christoffel symbols are 
not tensors so you must be clever when applying (8.103). Alternatively, you 
could do the proof abstractly by writing 4/ = X ® Y for two vector fields 
or 2-forms X and Y and then extend everything to higher-order tensors by 
linearity. 
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8.37 Killing fields The vector field X is called a Killing field (after the German 
mathematician Wilhelm Killing) if L x g = 0, where g is the metric tensor. 

(a) Show that X is a Killing field if and only if 


g(V Y X,Z) + g(Y, V z X) = 0, 


( 8 . 110 ) 


where V is the Levi-Civita connection. This is called Killing’s equation. 
Hint: Use (3.121). 

(b) If X = X' 3, in a coordinate basis, show that Killing’s equation can be 
written as 



( 8 . 111 ) 


for all pairs i, j, where Xj := g,kX k . 

(c) Let y be a geodesic with tangent vector Y . Show that g(X , T) is a 
constant along y . 

(d) Suppose that we choose local coordinates such that X = d/dx l . (By the 
existence and uniqueness theorem for ordinary differential equations we 
can always do this. See Lemma F.4.) Show that the components of the 
metric tensor do not depend oni 1 , i.e., show that dgjk/dx 1 = 0. 

(e) More generally, show that the flow of a Killing field is an isometry of the 
manifold. Hint: Use C x g = 0 and integrate. 

(f) Show that the set of Killing fields forms a Lie algebra under the Lie 
bracket. Remark: The resulting Lie algebra is precisely that of the 
symmetry group of the manifold. 

8.38 Spatial symmetries In Euclidean space the Christoffel symbols vanish so, 

in the standard coordinate basis, Killing’s equation reduces to 


%i,j + § 7 ',! - 


where Hi,j = diji/dx j . 

(a) Show that in three-dimensional Euclidean space the general solution to 
Killing’s equation is 


Hi = OtijX J + a t , 


( 8 . 112 ) 


where is a constant antisymmetric tensor and a, is a constant vec- 
tor. Hint: Show that Hi,jk = 0 by alternating the application of Killing’s 
equation and the fact that mixed partials commute. Remark: Observe 
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that there are three independent degrees of freedom from a i; - and another 
three from a, giving a total of six independent Killing vectors. It turns 
out that this is the maximal number of Killing fields on any three- 
dimensional manifold (although, of course, there can only be three 
linearly independent Killing fields at any given point). 

(b) According to Exercise 8.37 the (co-)vector fields (8.112) are the infinitesi- 
mal generators of symmetries. With this interpretation the a t represent the 
three independent translation symmetries of Euclidean space. We wish to 
interpret the other Killing fields geometrically. Consider the Killing field 
with a j = 0, a 12 = —0121 = 1, and all other components of equal to 
zero. Show that the integral curves of this Killing field are circles around 
the origin in the xy plane, thereby verifying that this Killing field is the 
infinitesimal generator of rotations about the z axis. (The rotations about 
the x and y axes are obtained similarly, from (*23 = 1 and <*31 = 1, 
respectively.) Remark: The solutions of Killing’s equation give all the 
continuous symmetries of Euclidean space, namely translations and rota- 
tions. There are additional discrete symmetries, namely the reflections 
x‘ — »■ —x 1 . The set of all these symmetries is called the Euclidean group 
and usually denoted E(n), where n is the dimension of the Euclidean 
space. It is a noncompact Lie group. 

(c) Show that, in spherical polar coordinates, the three (normalized) rota- 
tional Killing fields of Euclidean space about the x, y, and z axes are, 
respectively. 



— cot# cos tp — , 
dtp 


,,, d 

£ K 1 - cos tp — 
5 V dO 


— cot 9 sirup — , 
dtp 



8.39 Electric and magnetic monopoles in spherically symmetric spacetimes 

A spherically symmetric spacetime (four-dimensional Lorentzian manifold) 
admits coordinates (t, p, 9, <p) in which the metric takes the form 36 


ds 2 = —a 2 dr 2 + b 2 dp 2 + r 2 (d0 2 + sin 2 6 dtp 2 ), 


where a = a( r, p), b = bij. p), and r = r ( r, p) are undetermined functions. 


36 


See e.g. [39], Appendix B. 
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(a) Let F be a 2-form. Show that the condition that F be invariant under 
rotations implies that it can be written as 

F = A dr A dp + B sin 0 dd A dtp, 

for some functions A = A(r, p) and B = B( r, p). Hint: Set C^j) F = 0 
for 7 = 1,2,3, where the are the Killing fields from Exercise 8.38(c). 
The coordinates 0 and cp in that exercise and in this one may be taken to 
be the same because they define two-dimensional spherically symmet- 
ric integral submanifolds in both spaces. Lastly, by convention, F = 
\ Fjj dx l A dxK 

(b) Show that 


Bab 

*F = — dr A dp + 

r z 


Ar 2 sin 6 
ab 


d6 A dtp. 


Hint: Use (8.91) together with a = ■ S /\G\ dr A dp A dO A dtp. Guess 
k(d x A dp) = a dO A dtp and compute a, etc. 

(c) Assume now that F is the electromagnetic 2-form. Show that Maxwell’s 
equations dF = d * F = 0 imply that 


(Fij) = 


( 0 

Qe/r 2 

0 

V o 


- Qe/r 2 0 0 \ 

0 0 0 

o o Qm/r 2 

0 - Q m /r 2 0 / 


where F = ^ F-..6 1 A O' , {0'} is an orthonormal coframe, and Q e and Q m 
are constants. Remark: F represents the electromagnetic field at r 0 
due to a point source (i.e., a monopole) with electric charge Q e and 
magnetic charge Q m located at the origin of coordinates. 

8.40 Schwarzschild geometry In 19 16, shortly after Einstein created his general 
theory of relativity, Karl Schwarzschild discovered a solution (i.e., a metric) 
describing an isolated nonrotating black hole. In terms of the coordinates t, 
r, 0, and </> (corresponding roughly to time and the usual three-dimensional 
spherical polar coordinates), the Schwarzschild line element is 


ds 2 = — 4>(r) dt 2 + 4>(r) 1 dr 2 + r 2 (d0 2 + sin 2 0 dtp 2 ), 


where 


2m 


<5(r) := 1 - 


r 
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(a) Show that the following 1 -forms constitute a dual orthonormal frame 
field: 

0° = d> ! / 2 dt, 0 l = <&~ l/2 dr, 6 2 — rdO, and 0 2, = r sin6> dtp. 

Hint: In an orthonormal frame the (Minkowski) metric components are 
§66 = £°° = 8 fi = 8 U = U for i = 1 , 2 , 3 . 

(b) Show that the Levi-Civita connection forms in the orthonormal basis 
of part (a) satisfy co°q = 0, n/k = of g, and of ■■ = — <yU, where 
i,j = 1 , 2 , 3 . Hint: Exercise 8.6 implies that oy v jj = —o)jx 0, where 
fi, v = 0 , 1 , 2 , 3 . 

(c) Because the torsion vanishes, we have 

d 0 h = -af~ h A 9 b . 

Show that (u°j = of = ( m/r 2 )dt . Hint: Evaluate d6° and guess the 
connection form. Note that there is some ambiguity in this procedure, 
which can only be removed by computing all the connection forms. 

For reference, we record the remaining nonzero matrix elements here: 

<o\ = -aj 2 i = — d> 1/2 d 9 , 
co ' 1 3 = — a? 1 = — <J> 1/2 sin Odcj), 
or 3 = — cu 3 5 = — cos Odcj). 

(d) Use the matrix of connection 1 -forms to compute the curvature forms £ 2 ° j 
and in the orthonormal frame field basis. 

8.41 Parallel transport on the 2-sphere Let V be the Levi-Civita connection 
on the 2-sphere, and let y (t) be a curve that traces out a latitude 6 = 0q. 

(a) Show that the map that parallel transports vectors around y is given 
explicitly by 


d(y, V) = 


( cos(0 cos do) 

— sin (4> cos 0 o ) / sin 0 O 


Hint: Solve the parallel transport equation ( 8 . 63 ). You will need to use 
the results of Exercise 8 . 5 . Assume the initial condition Y ( 60 , 0 ) = Yq. 
(b) Show that a vector parallel transported around y turns through an angle 


ft = 2 jt cos 6q . 
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Hint: Recall that the cosine of the angle between two vectors X and Y is 
g(X, Y)/y/g{X, X)g(Y, Y ). 

Remark: In flat space, a vector that is parallel transported around a circle 
turns through an angle of 2n . (Why?) The difference 27r(l — cos0 o ) is 
called the spherical excess associated with the curve y, and it is a measure 
of the total curvature enclosed. In two dimensions the (intrinsic) curvature 
of a surface at a point can be defined by 

1 

curvature = lim —(spherical excess), 

A— >-0 A 

where A is the area enclosed by the curve. For the sphere, the area 
enclosed by the latitude is 2tt R 2 ( \ —cos 9 0 ), so the curvature of the sphere 
is just l/R 2 . 

8.42 Geodesics on the 2-sphere Compute the geodesics on the sphere and show 
that their images are great circles. Hint: A few tricks are needed for this exer- 
cise. From the results of Exercise 8.5 one derives the (coupled, second-order) 
geodesic equations 

9- (sin6>cos6>)</> 2 = 0, (8.113) 

4> + 2(cot 9)<p9 = 0. (8.114) 

However, these are difficult to solve directly. Instead we seek so-called “first 
integrals” of the motion to reduce them to first-order equations. Show that 
(8.114) can be integrated to yield 

(sin 2 9)<p = J = constant. (8.115) 

Substitute this into (8. 113) and integrate to get 

9 2 + J 2 esc 2 9 = a 2 = constant. (8.116) 

Now integrate (8.116) and plug the result back into (8.115); then integrate 
again to get 

cos 9 = — y 1 — (J /a) 2 sin(n? + c) (8.117) 


and 


tan(</> — 4> 0 ) = (J /a) tan (at + c). (8.118) 

For the shape of the curve, substitute (8.1 18) into (8.1 17) to obtain 


cot 9 = C sin (4> — 4>o), 


(8.119) 
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where C := —J(a/J) 2 — 1. Identifying the resulting curves may be easier if 
you transform the solution back to Cartesian coordinates. Of course, if you are 
willing to wave your hands a bit you could just observe that if </> = constant 
then (8.115) gives / = 0, whereupon (8.116) gives 6 = a, or 9 = at + c. 
This is a great circle (a longitude line), so by symmetry all the geodesics are 
great circles. 

8.43 Constant-curvature manifolds Let M be a Riemannian manifold, and let 
n be a two-dimensional subspace of T p M (a “2 -plane” of M). The sectional 
curvature K (fl) associated to the 2-plane fl is 


x(fi) 


g(X,R(X,T)T) 
g(X, X)g(Y,Y)-g(X,Yy- , 


(8.120) 


where X and Y are any two tangent vectors spanning n. 37 One can think of 
the sectional curvature as a kind of probe of the curvature at every point. Just 
as one can reconstruct a three-dimensional object from the two-dimensional 
sections of a computerized tomographical scan, so too can one recover the 
full Riemann curvature tensor at any point from its two-dimensional sectional 
curvatures. 

(a) Show that /C (TI) does not depend on the choice of spanning vectors. 
Hint: Replace X and Y by linear combinations of X and Y. Sim- 
plify the problem by invoking some of the symmetry properties (8.47)— 
(8.50). 

(b) Prove a uniqueness theorem: The Riemann curvature tensor of M at p 
is determined uniquely if one knows the sectional curvature of every 
2-plane of M through p. Hint: The proof is purely algebraic. Sup- 
pose that R and R' are two curvature tensors satisfying (8.120). Then 
g(X, R(X, Y)Y) = g(X, R\X, Y)Y). Send X to X + W in this equa- 
tion and use the symmetry properties of the Riemann tensor to show that 
g(W, R(X, Y)Y) = g(W, R’(X, Y)Y). Use a similar trick (and (8.49)) to 
obtain g(W, R(X, Y)Z) = g(W, R f (X, Y)Z) for arbitrary vectors X, T, 
Z, and W, and thereby conclude that R = R' . 

(c) If K (11) is independent of the particular 2-plane IT at p then M is 
isotropic at p, and we write K p for K (Tl). The manifold M is isotropic 
if it is isotropic for all p e M. Show that M is isotropic if and only if 

Rijkt = K p (g ik gji - gugjk) ( 8 . 121 ) 

37 The vectors X and Y are linearly independent, so by the Cauchy-Schwarz inequality the denominator 
is strictly positive. 
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in local coordinates about any point p. Hint: One direction is easy. For 
the other, define a tensor R' by 

g(W, R'(X, Y)Z) := g(X, W)g(Y, Z ) - g(X, Z)g(Y, W) 

and use (8.120) together with the result of part (b) to show that R = 
K p R'. 

(d) Choose an orthonormal coframe field {9 a } on a neighborhood of p. Show 
that (8.121) is equivalent to the statement that 

= K p 9 h Ad\ (8.122) 

where Q, is the curvature 2-form. 

(e) If K p =: K is constant (i.e., independent of p), M is said to have con- 
stant curvature. Prove Schur’s theorem: If M is a connected, isotropic 
Riemannian manifold of dimension at least 3 then M has constant cur- 
vature. In other words, in higher-dimensional manifolds isotropy forces 
the sectional curvature to be constant. Hint: Take the exterior derivative 
of (8.122) and use the Bianchi identity and the vanishing of the torsion 
form to show that dK A 9 a A 9 b = 0 for 1 < a,b < n. (You may also 
want to use the result of Exercise 8.6.) 

8.44 Gaussian curvature If M is two dimensional then there is only a single 
2-plane through any point, so the isotropy condition of Exercise 8.43 is trivial 
and Schur’s theorem is inapplicable. Instead, the sectional curvature K p := 
K(TpM), now called the Gaussian curvature of M , can and usually does 
vary from point to point. From the definition (8.120) we see that the Gaussian 
curvature contains all the curvature information of M, because there is only 
one linearly independent component of the Riemann tensor and K is basically 
this component. Show that, in fact, the Gaussian curvature can be expressed 
as one-half the Ricci scalar curvature. 

8.45 Hyperbolic geometry It is a classical fact (e.g. [23], Chapter 8) that if 
M is a complete, 38 connected and simply connected manifold with constant 
curvature then it can only be one of three types: (i) a sphere, (ii) Euclidean 
space, or (iii) hyperbolic space. The purpose of this problem is to investigate 
the last space. 

There are several models of hyperbolic space; here we use the Poincare 
upper half-plane model, which consists of the upper half-plane := 
{(*, y) e M 2 : y > 0} equipped with the line element 


38 


A (pseudo)-Riemannian manifold is (geodesically) complete if, for any vector X p , the exponential 
map Exp tXp (defined in Appendix D) exists for all 1 e R. Intuitively, you can walk forever along a 
geodesic and not worry about falling off the manifold. 
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ds 2 


dx 2 + dy 2 


(8.123) 


(a) Show that all the points on the x axis are at an infinite distance from the 
points off the axis. 

(b) Compute all the Christoffel symbols. (Here’s one: T x xy = — I / v.) 

(c) Write down and solve the geodesic equations for this metric. What do the 
curves look like geometrically? Hint: The geodesic equations are 


yx — 2 xy = 0, 
yy + x 2 — y 2 = 0. 


The first equation is separable and yields x = cy 2 for some constant c. 
Set / := y /y . If c = 0 then the second equation gives / = 0, which you 
can easily solve. If c 0 then the second equation becomes / = —cx, 
which can be integrated to give xx + yy = ax. Integrate again to get the 
second class of geodesics. 

(d) Show that the Poincare upper half-plane has constant (Gaussian) curva- 
ture K = -1. 


Remark: Hyperbolic geometry has great historical significance. Two thou- 
sand three hundred years ago Euclid set down his axioms of geometry in 
the form of five postulates. All the postulates seemed self-evident except the 
fifth, which states that, for every straight line L and point P not on L, there 
is a unique line L' through P and parallel to L. For many centuries, people 
tried to prove the fifth postulate from the other four, but failed. The reason 
for their failure, which surprised almost everyone, was that there are other 
perfectly consistent geometries in which the first four axioms hold but the 
fifth does not, a fact discovered around the same time by Bolyai, Gauss, and 
Lobachevsky. Gauss observed that, in spherical geometry, given a “straight 
line” (meaning a geodesic or great circle) and a point P not on that “line”, 
every “line” through P meets the original “line”. Bolyai and Lobachevsky 
discovered hyperbolic geometry, in which there are an infinite number of 
“straight” lines (geodesics) through a fixed point P that do not meet a given 
“line”. Beltrami supplied several models of hyperbolic geometry, including 
the Poincare upper half-plane. 


8.46 Mobius transformations Let A = 


e G L(2, C), the Lie group of 


f a o' 
v c d y 

2x2 complex matrices with nonzero determinant. The fractional linear map 
Ta : C C given by 


Additional exercises 


235 


az + b 


is called a Mobius transformation. It has all sorts of surprising properties 
and plays an important role in geometry and number theory. 

(a) Show that the set { 7 a} of all Mobius transformations forms a group under 
composition called the Mobius group, denoted here by AA. 

(b) Show that the map A t-» T A is a homomorphism of GL( 2, C) into AA 
with kernel consisting of all nonzero complex multiples of the identity 
matrix. Remark: By the first isomorphism theorem of group theory this 
shows that M. = G L (2. C)/{A/} = PGL( 2, C), the projective general 
linear group of 2 x 2 complex matrices. 

(c) Let AA' denote the subgroup of Mobius transformations for which A € 
SL(2 , M). 39 Show that T,\ e AA' is a diffeomorphism from the upper half 
plane H;j_ to itself (where we make the usual identification of M 2 and C 
via z = x + iy). 

(d) Show that the elements of AA' are isometries of hyperbolic geometry, by 
showing that T A ds 2 = ds 2 where ds 2 is the hyperbolic metric (8.123). 

8.47 Induced metrics Let f : M /V be a smooth immersion of a manifold M 
into a geometric manifold N equipped with a metric g. Then M is naturally 
a geometric manifold when equipped with the induced metric f*g. Suppose 
E to be a two-dimensional surface in M 3 given by the parameterization 


a : (u, v ) — > (x(u, v ), y(u, v), z(u, u)), 


and let h be the induced metric on E . Define 



and 



(a) Show that we can write 


h = E du <8> du + F du ® dv + F dv ® du + G dv ® dv 
= E du 2 + 2 F du dv + G dv 2 , 


where 


E := g(p u , cr u ) , F := g{a u , o v ), and G := g(a v , cr v ). 


39 


The special linear group SL(n, R) consists of all n x n real matrices with unit determinant. 
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Hint: Identify / with the parameterization map. Write 

g = dx 2 + dy 2 + dz 2 = dx <8> dx + dy <8> dy + dz 0 dz 

and use the fact that /*(/x ® v) = /'*/i ® /*v. Remark: For more fun 
with immersions, see Appendix H. 

(b) The usual parameterization of the sphere in terms of spherical polar 
coordinates is 

a (9, 4 > ) = (sin 0 cos 0, sin0 sin0, cos 9), 0<9<tt,0<4>< 2tt. 

(Yes, the parameterization is not well behaved at the poles - just ignore 
those points.) Compute the induced metric tensor on S 2 in spherical polar 
coordinates and verify that it agrees with the metric given in Exercise 8.5. 

8.48 Gaussian curvature of the catenoid A catenoid is a two-dimensional sur- 
face of revolution in M 3 obtained by rotating a catenary about an appropriate 
axis. One parameterization is given by 

a(u, v) = (a cosh v cos u , a cosh v sin u , av ) , 

0 < u < 2n, — oo < v < oo. 

Show that, in these coordinates, the Gaussian curvature of the catenoid is 
K = — a~ 2 cosh“ 4 v. Hint: Find an orthonormal coframe field {0 l , 6 2 } on 
the surface and compute the curvature 2-form £1. Use (8.122). For another 
method, see Appendix H. 

8.49 Gaussian curvature of the standard torus One parameterization for the 
standard embedding of the torus in M 3 is given by 

x = (b + a cos v ) cos u, 
y = (b + a cos v) sin u, 

Z = a sin v, 

where 0 < u,v < 2 tt and a < b. Calculate its Gaussian curvature in the 
induced metric and verify that it is not identically zero. Hint: See the previous 
hint. 

8.50 Geometry of Lie groups (For readers who completed the problems on Lie 
groups in Chapter 3 and Exercise 3.52 in particular.) If G is a semisimple Lie 
group then its Killing form is a nondegenerate inner product on the tangent 
space to G at the identity. This form can be extended naturally to a metric on 
the whole of G, defined as follows: 


(Xg, Yg) . — (Lg-i^Xg, L g -i*Yg) e , 


(8.124) 
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for any vectors X g , Y g e T„G. In this way G becomes a natural Riemannian 
or pseudo-Riemannian manifold, depending on the signature of the Killing 
form. 

(a) In addition to the natural left action of G on itself, given by L g h = gh, 
there is a natural right action given by R g h = hg. By construction L g 
is an isometry, so we say the metric is left-invariant . 40 Show that the 
metric is actually bi-invariant, meaning that R g is an isometry as well. 
Hint: Note that Adg = <p(g)*, e , where cp(g) = L g o R g ~ 1 = R g ~ 1 o L g . 
Now use the result of Exercise 3.52(b). 

(b) Let V be the Levi-Civita connection on G determined by the Killing form 
and let X and F be left-invariant vector fields. Show that 

V x Y = ^[X,Y]. (8.125) 

Hint: Let Z be a left-invariant vector field. Consider (V^F, Z) and apply 
the Koszul formula. Note that (X, F), (F, Z), and ( X , Z) are constant. 
Why? Remark: The formula above implies that V x X = 0 for any left- 
invariant vector field X. This means that the integral curves of X are 
geodesics. In particular, the geodesics through the identity are precisely 
the one-parameter subgroups of G. 41 

(c) In the next three parts of the exercise, let X, Y, and Z be left invariant 
vector fields. Show that 


R(X,Y)Z= l -\\X,Y],Z\, 


(8.126) 


40 This is supposed to be obvious from the definition, but it may be difficult to see when using the 
parenthetical notation for the metric (8.124). Instead, temporarily denote the metric by the letter m 
(so as not to confuse it with the group element g), and let B denote the Killing form. The statement 
that Lg is an isometry is just the statement that L*m = m. To prove this, note that for any h e G 
and any vectors and Y/ t , 


(L gin)(Xf l , — m(Lg*Xi t , 

Lg*Yh)gh 

= B H (glg) -\ af Lg*Xh- L^-i^LgxY/,) 

= L h -i s -i*L g *Yh) 

= B(L h - l *X h ,L h - 1 J h ), 

= m(X h ,Y h ) 


(by (3.90)), 
(by (8.124)), 


(by the chain rule). 


(by (8.124)). 


(It is worth emphasizing that X/, and F/, are just vectors, not specialized vector fields.) 

41 These geodesics are given by the exponential map, which explains the use in Appendix D of the same 
terminology for more general Riemannian manifolds. 
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where R is the Riemann curvature tensor. Hint: Use part (b) and 
the Jacobi identity. Remark: This formula completely determines the 
Riemann curvature tensor, because we can always find a basis of left- 
invariant vector fields on G. 

(d) Show that the sectional curvature of G is given by 


K(X,Y) = 


1 ([X, Y], [X, Y]) 

4(X,X)(Y, Y)-(X, Y) 2 ' 


(8.127) 


(e) Show that the Ricci tensor equals the bi-invariant metric (8.124) up to a 
constant. That is, 


1 

Ric(X, Y) = — -(X, T). (8.128) 

A manifold for which the Ricci tensor is proportional to the metric tensor 
is called an Einstein manifold, so semisimple Lie groups are Einstein 
manifolds. 

8.51 Standard metrics for constant curvature 3-manifolds Let M be an 

isotropic three-dimensional Riemannian manifold with local coordinates 
r, 9, <p and line element 

ds 2 = dr 2 + f 2 (r)(dd 2 + sin 2 9 dip 2 ), 


where / — > 0 as r — > 0. 

(a) Starting from the orthonormal coframe 

9 2 — dr, 9 2 — f(r)d9, 0 2 = f(r) sin# dtp, 

compute the connection 1 -forms of j. 

(b) Compute the curvature 2-forms QG. 

(c) By Schur’s theorem, M must be a constant curvature manifold. By rescal- 
ing coordinates if necessary we may take K = 1 , K = 0, or K = — 1. 
Show that these cases correspond to the respective choices f(r) = sin r, 
r, and sinhr. Remark: To get these forms you may have to flip the sign of 
r, which is all right because r -> —r turns out to be an isometry of M. 

8.52 Pseudotensor densities and the Levi-Civita alternating symbol (In this 
exercise a coordinate basis is assumed throughout, so for ease of exposi- 
tion we make no distinction between a tensor and its components. Also, we 
assume that dim M — n.) The Levi-Civita alternating symbol (or, epsilon 
symbol) is defined by the following two properties: (i) e 123.../, = +1 and 

(ii) £11 ...;„ flips sign if any two indices are interchanged (total antisymmetry). 
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Equivalently, vanishes if any two indices are the same and equals the 
sign of the permutation (i\ . . . i n ) otherwise. 

(a) Show that, if (A 1 j) is any n x n matrix, 

Sh...i n A h h ■ ■ ■ A ‘" j n = ^i..,/ n detA. (8.129) 


(b) A multi-indexed object T iu . _j n is called a pseudotensor if, under a 
change of coordinates, it transforms according to 


dx' 1 dx 1. 


T i’ v ..i' n = ( s g n J)—, r ■ ■ ■ 

1 dy‘> ov" 


dy'n 


(8.130) 


where J := det(3y ! /dx 1 ) is the Jacobian determinant. 42 It is called a 

tensor density of weight w if 


Ti'i' = \JV 


, 3jc ;i 
dy‘i 


dx" 
T 

„ :/ 
dy'n 


(8.131) 


where | • | means the absolute value. We extend the Levi-Civita sym- 
bol to the entire manifold by demanding that it transform under a 
coordinate change according to 


Si’ — £ 


dx h 

dy‘i 


dx '» 


dy'n 


it £ il -in 


(8.132) 


Show that § = J and thereby show that i n is a peudotensor density of 
weight —1. 

(c) Let G := det g,j . Show that G is a scalar density of weight 2, i.e., show 
that 


(d) Define 


G' = J~ 2 G. 


£ h—i n ■ — yJ\G\£i l ...i„. 


(8.133) 

(8.134) 


42 A pseudotensor is just like a regular tensor except that it suffers an additional sign change under an 
orientation-reversing coordinate transformation. Note that on an oriented manifold we can choose 
the Jacobian to be positive everywhere, so for most purposes the distinction between tensors and 
pseudotensors is immaterial. The distinction becomes important in physics where one wishes to con- 
sider the hypothetical possibility of a parity transformation (a global coordinate inversion sending 
x l — > —x l ). The reason is that certain physical quantities such as the position vector r or the velocity 
vector v (so-called polar vectors) flip sign under a parity transformation whereas other quantities such 
as the angular momentum vector L (a pseudovector or axial vector) do not (because L = mr x v). 
The behavior of objects under parity transformations places constraints on the way in which they can 
appear in the various laws of physics, especially if it is known that the corresponding phenomena are 
(or are not) parity invariant. For example, if parity were conserved in a certain physical process then 
a term such as r ■ L (a pseudoscalar) could not appear in the equation of motion as it would violate 
parity conservation. 
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Show that this is a pseudotensor. Despite this fact, it is almost always 
referred to as the Levi-Civita alternating tensor or epsilon tensor. 

(e) Show that the volume element can be written in local coordinates as 


a = — €j l t dx' 1 A • • • A dx‘ n . 
n\ 


(8.135) 


(f) Define e' 1 -'", the contravariant cousin of by requiring that s' 1 " 1 " be 
the sign of the permutation (i\, , i n ) and that it transform according to 


,, dy'i 3 y' n ,■ , 

e'i = n— — s' 1 ' 1 " 

dx' 1 dx‘ n 


(8.136) 


Show that t] = J 1 , so that s' 1 - 1 " is a pseudotensor density of weight +1. 
(g) Now define the upstairs version of the Levi-Civita alternating tensor by 


1 


'VW\ 


(8.137) 


where the sign is chosen so that 


— g‘Ul . . . g'njn f . 

fc —6 8 tjl-jn' 


(8.138) 


Show that the sign is +1 if the signature is Euclidean and —1 if the 
signature is Lorentzian. 

(h) Show that the covariant derivative of the Levi-Civita tensor is zero. 

8.53 Epsilon tensor identities Assume that M is an n -dimensional manifold (so 
that all indices run from 1 to n). 

(a) Prove the following so-called epsilon symbol identities. For 0 < k < n. 


p/l ...ikik+i •••i n p 




(8.139) 


where 


°jl—jk 


— det 


(»1 ■ 

• 

Jk 


■ «$'> 
Jk 


(8.140) 


In this formula S', is the usual Kronecker delta and S'f" 1 . is the gener- 
alized Kronecker symbol. (By convention ik — jk = 0 for k < 0 
or k > n.) Hint: First prove the result for k = n by showing that 
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both sides have the same symmetry properties and that they agree when 
(ii, .... i n ) = jn) — ( 1 , . . . , n), then work down from n to 0 . 

(b) What is the analogue of (8.139) for the epsilon tensor? Consider both the 
Riemannian and Lorentzian cases. 

8.54 Hodge duals and Levi-Civita tensors Let M be a Riemannian manifold of 
dimension n, and let 


a = —ai lm j k dx n A • • • A dx‘ k (8.141) 

be a k-form expressed in local coordinates. Verify that the Hodge dual of a 
can be written as 


1 


*a = 


(n - k)\ 


a : 

>k+ 1 - 


dx' k+1 A • • • A dx ln 


(8.142) 


where 


*i k +i-in •— ^n-ikh 


l k l k+l--- l n 


(8.143) 


Here e/j...;,, is the Levi-Civita alternating tensor, and indices are raised and 
lowered by the metric. Remark: If you prefer, you can rewrite these formu- 
lae without the factorials by using the parenthetical notation introduced in 
Chapter 2. In that case you would write 


a = dx n A • • • A dx‘ k , (8.144) 


* a = a*j k+1 in ) dx' k+l a • • • A dx ' n , (8. 145) 

and 


a* , , :=a lh ’ 


■ Jk ) 


e <l~'ki, 


l k l k + 1 


(8.146) 


respectively, where (i i, . . . , ik) denotes the ordered index collection i\ . . . 4 
if i\ < ■ ■ ■ < ik and zero otherwise. 

8.55 More Hodge duals Let a and ft be forms. In local coordinates we have 


and 


a = —a:. dx n A • • • A dx‘ k 
k\ 

P = dx 11 A • • • A dx' k . 


Show that 


a A *P = —a n - lk b h .j k cr, 


(8.147) 
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where a is the volume form. This can also be written, using the parenthetical 
index notation of Chapter 2, as 

a A */3 = a (n '"" lk) b (il ^ ^ k )0. (8.148) 

8.56 div, grad, curl, and all that Let / be a smooth function and X = X' 3, 
a smooth vector field on a Riemannian manifold. By analogy with their flat- 
space counterparts, we define the divergence, curl, and gradient as follows: 


div X 

= V;X', 

(8.149) 

curl X 

= e ijk (yiXj)d k , 

(8.150) 

grad / 

= § ij (V/ /) 3; = (V7)3„ 

(8.151) 


where V is the Levi-Civita connection. The Laplacian or Laplace-Beltrami 
operator is 

div o grad / = g ij ViVjf = V 1 ' V,-/ = V, V'/ = V 2 /- (8.152) 

The divergence, gradient, and Laplacian are defined in any dimension but the 
curl is defined only in three dimensions. 43 

(a) Set G := det gjj. Show that we can write 

div X = G~ l/2 (G 1/2 X j )j (8.153) 

so that 

V 2 / = G- 1/2 (G l/2 g iJ fi)j. (8.154) 

Hint: Show that (G 1 ^ 2 )j = G l ^ 2 T k k j- You may find Exercise 1.36 
helpful. Don’t forget the chain rule. 

(b) Show that 

Cxct = (div X)cr, (8.155) 

where a is the canonical volume form. Hint: You may wish to use the fact 
that Cx is a derivation and that it commutes with the exterior derivative on 
forms. Remark: Equation (8.155) is sometimes taken to be the definition 
of the divergence. 

43 In fact the cross product can only be defined in three dimensions. It is perhaps worth observing that 
these definitions are easy to state in terms of components but awkard in the extreme to formulate 
otherwise. For example, to define the curl of the vector field X you must first introduce the 1-form 
X = g(-, X), then take dX to get a 2-form, then take the Hodge dual to get another 1-form +dX, 
and finally turn that back into a vector field using the inverse metric to get the vector field g(-, +dX). 
Believe it or not, this is the curl of X. Score one for the index crowd. 
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(c) In flat three-dimensional space, the divergence of a curl and the curl of 
a gradient vanish. Show that the same thing is true in a curved three- 
dimensional space. 

8.57 div, curl, grad, and Laplacian in spherical polar coordinates Spherical 
polar coordinates (r, 0, </>) on M 3 are related to Cartesian coordinates by 


x = r sin 6 cos ( 


y — r sin 9 sin tp , 


and 


z — r cos 9. 


(a) Show that the Euclidean metric tensor components in spherical polar 
coordinates are given by 



0 

0 


0 0 Usin 0, 


(8.156) 


(b) Verify that 


Cy . 


dr 


1 9 

e* := , and 

6 rd9 


1 


:= 


(8.157) 


r sin 9 dtp 

constitute an orthonormal frame field. 

(c) Using (8.153), (8.150), (8.151), and (8.154), show that, in spherical polar 
coordinates, 

Id* 1 a 

(X 6 sin (9) + 


divX = i— (x'V 2 ) 


curl X = 


r 2 dr 

1 

r sin 9 


+ 


— (sin0X^) 
90 


r sin0 d9 

dX § 


r sin0 dtp 


X *, 


dtp 


l 

H — 
r 

1 

H — 
r 


1 dX r 9 i 

(rX 0 ) 

sm0 dtp dr 


ee 


3 a 
iT (r X 0 ) 
dr 


dX r 

~d9~ 


&(j) ? 

9/ 


9/ 19/ 1 

grad / — — e? + e§ H r— — c,, 

dr r d9 r sm0 dtp v 


and 


1 9 

r 2 dr 


v 2 / = 4f (^ 2 ^) + 


dr 


1 


9 


r 2 sin0 90 


3/ 


9 2 / 


sin 0 — ) H „ 

90 / r 2 sin 2 0 dtp~ 


Watch out! The components of X are defined relative to the orthonormal 
frame field by the expansion X = X r e~ r + X e e§ + X^e^, but all the 
formulae assume a coordinate basis. 
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8.58 The Weitzenbock formula ( Warning : This problem may cause index 
blindness and headaches.) Let a be a k-form given by 

a = — aj 1 ...j lc dx' 1 A • • • A dx‘ k 


in local coordinates, 

(a) Show that 


dot = — dx' 1 A • • • A dx' k+1 


= — V t , • 1 a I - 2 ...i i+1 ] <ix !1 A • • • A dx ! * +1 
^ £+1 

= Ft-D'-'V,. a. * • dx h 

(k + 1 )! r ‘ 


A • • • A dx ‘ k+l , 


(8.158) 

(8.159) 

(8.160) 


where we are using the shorthand notation introduced in Exercise 8.31, 
in which Vy7 ) x __ ik = and the bracket in (8.159) means that you 

are to totally antisymmetrize the indices. Remark: The point is that we 
can replace the ordinary derivatives appearing in the exterior deriva- 
tive by covariant derivatives, because the components a, are totally 
antisymmetric and the Christoffel symbols are symmetric. 

(b) Similarly, show that, if S is the co-differential operator, 


8a 


1 


(* — 1 )! 

1 


(k - 1)! 


V il a h .j k dx i2 A--- Adx ik (8.161) 

g' 1 -' . A • • • A dx' k . 


Hint: You will probably want to use the Leibniz rule and metric compat- 
ibility. At one point you may want to use the fact (see Exercise 8.52(h)) 
that the covariant derivative of the Levi-Civita tensor vanishes. Remark: 
This exercise shows that if X = X 1 9, is a vector field and X = X, dx' 
is the corresponding covector field then 8X = — div X, the divergence of 
X. (See Exercise 8.56.) 

(c) Let A = d8 + 8d be the Laplace-de Rham operator. Writing 
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prove the Weitzenbock formula, 


k 

(Aa)/j ...i k = — V fl/j ...i k + Q)'| ii r +] ...ik R i r 

r= 1 


1 

2 


E 1 

r=l...£ 
5=1. ..£ 
r^s 


i\ ...i r -\ii r - |_i l J *s+i • 


^ iV l r l s ’ 


where 


(8.163) 


R- j-.= g im R mj and R ij u := g jm R* m u (8-164) 


are raised-index versions of the Ricci and Riemann tensors, respectively. 
Hint: You may wish to use (8. 109). Remark: This is the first in a long line 
of Weitzenbock-type formulas. 

8.59 Bochner’s theorem Again let a be a k-form (0 < k < n) with local 
components a (i Assume that M is a compact, connected, orientable 

Riemannian manifold without boundary. 

(a) Assuming that a is harmonic, show that 


f [-(VV^V 1 ^ + kF{a)] a = 0, 
J M 


(8.165) 


where 


F (a) := Rija ih - ik a j i2 ... ik - -{k - l)Rij pq a iji3 ' ik a pq i3 __. ik . (8.166) 


Hint: Consider (Aa, a) and use (8.147) and (8.163). 

(b) Prove the following integration by parts formula: 


J M 


■■ ik V 2 a; 


= 


/< 


(VV 1 - 


l )(V;fli 1 ... i4 )or. (8.167) 


Hint: Show that f M ^ 2 (ai l ...i k a n ' lk )cr = 0 (*). Do this by applying the 
Weitzenbock formula (8.163) to A(a, 1 ... (lt a !l ’" it ) and using Theorem 8.10. 
Now expand (*) using the Leibniz rule. 

(c) Prove a celebrated theorem due to Bochner: IfM is a compact, connected, 
orientable Riemannian manifold without boundary and with constant 
positive curvature then M is a homology sphere ( i. e. , all the Betti num- 
bers bk (0 < k < n) vanish. (We already know that b 0 = 1, because M is 
connected, and that b n = 1, by Poincare duality.) Hint: Combine (a) and 
(b). Use (8.121) to show that F(a) = kl(n — k)K\\a\\ 2 , where K is the 
constant positive curvature and Ha'll 2 := Don’t forget 

Theorem 8.12 of Hodge. 
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8.60 Areas of spheres and volumes of balls. Let 


co = dx l A • • • A dx' l+l 


be the standard volume element on M' !+1 , and define r 2 = Y2i = Denote 
by V n+ \ the volume of the unit ball B" +1 = {x e M ' !+1 : r 2 < 1}, so that 


V„+i 



co. 


Also, let A„ denote the n -dimensional “area” of the unit n -sphere 5" = 
3B n+l = {x e R n+1 : r 2 = 1}, so that 


A 


n 



a. 


where a is the volume form on S" . We seek an explicit expression for a in 
Cartesian coordinates. One way to do this would be to compute a in multi 
dimensional spherical polar coordinates and then convert back. A more ele- 
gant approach, which we follow here (with slightly different notation), is that 
of [25], p. 74. 

(a) Show that 


For example, V 2 = n = A\/2 and V 3 = 47 t /3 = A 2 / 3. Hint: By a 
dimensional argument, the area of an //-sphere of radius r is r" A The 
volume of the ball B n+] may be obtained by integrating spherical shells 
of thickness dr. 

(b) A key fact is that a must be spherically symmetric (i.e., invariant under 
rotations). To find such a form, we first differentiate both sides of 


r 2 — (x 1 ) 2 + • • • + (x n+1 ) 2 


to get 


r dr = x l dx 1 + • • • + x n+l dx n+l . 

This is a spherically symmetric 1-form on M" +1 . Its Hodge dual is the 
n -form 

n -\- 1 

r := */• dr = l)" + 1 x' dx 1 A • • • A dx' A • • • A dx' l+1 . 

1=1 
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This, too, is spherically symmetric, so restricting it to the /i-sphere must 
yield a multiple of a . Thus 


a = c x 


for some constant c. 44 Show that c — 1. Hint: Compute A„ using Stokes’ 
theorem. 

(c) For fun, show that 


7 _ 2 7r ” /2 

Vn ~ n Y(ti/2) ’ 

where r (x) is the gamma function. Hint: There are several ways to do 

this. One pretty approach proceeds from Dirichlet’s generalization of the 

_ 2 

beta integral ([4], p. 32). Another method uses the trick of integrating e 1 
over the whole of M" in both Cartesian and spherical polar coordinates 
([46], p. 63). A third approach is to slice the /i-ball into disks and use 
recursion ([25], p. 74). A related technique is given in ([5], p. 411). 

8.61 Spherical harmonics Let / be a harmonic function on M 3 , so that A / = 
—V 2 / = 0. (See Exercise 8.58.) Suppose that, in spherical polar coordinates, 

f(r, 6, cp) — r n h(0, (p) (8.168) 

for some n and some function h . Then h is called a spherical harmonic (of 
degree n). 

(a) By treating h as a function on the 2-sphere S 2 , show that 

Ah = n(n + 1 )h, (8.169) 

where A is now the Laplace-de Rham operator on .S' 2 . Hint: Use Exer- 
cise 8.56 together with the easy-to-prove fact that, acting on functions in 
spherical polar coordinates, V 2 on the 2-sphere is just the spherical part 
of V 2 on M 3 with r = 1. 

(b) Show that spherical harmonics of different degrees are orthogonal with 
respect to the Hodge inner product. 

8.62 Harmonic forms and the Hodge decomposition Fix a k-form A. Show 
that there exists a A -form a> such that 


A to = A 


(8.170) 


44 Technically, a = ci*r where t : 5" — > R" +1 is the inclusion map. 
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if and only if (A, ij) = 0 for every harmonic form rj. Hint: One direction is 
easy. For the other, invoke the Flodge decomposition theorem. Theorem 8.1 1, 
to show that A = da + 8/3 for some a and f. Write a = dci\ + <5/h + y\ then 
decompose again to show that da is harmonic. A similar argument shows that 
8/3 is also harmonic. 


9 

The degree of a smooth map 


All differences in this world are of degree, and not of 
kind, because oneness is the secret of everything. 

Swami Vivekananda 

Suppose you are walking your dog Spot. You put the leash on Spot and take him 
to his favorite tree to do his business, but he sees a squirrel on the tree and takes 
off after it. The squirrel is really tired of being chased, and decides to teach Spot 
a lesson. So, instead of climbing back up the tree, he runs in a counterclockwise 
direction around the trunk with Spot not far behind. As Spot follows him around 
the tree, the leash gets wound around the tree k times (assuming you stay in place 
while he is running). At this point Spot gives up and sits down near your feet to 
bark sullenly at the squirrel. We say that the winding number of the leash is k. 
No matter how you try to move the leash, unless you cut it, it will remain wound 
around the tree k times. 1 That is, its winding number is a homotopy invariant. This 
prosaic example generalizes to higher dimensions and has interesting mathematical 
and physical applications. 2 

We start with the stack of records theorem, so-called because it reveals that all 
smooth maps from a compact manifold to another manifold of the same dimension 
look like a smooth covering by a stack of records. (See Figure 9. 1 .) 

Theorem 9.1 (Stack of records theorem) Let f : M — »■ N be a smooth map 
between manifolds of the same dimension, with M compact. If q is a regular value 
of f then f ~ 1 (q ) is a finite set of points [p\, . . . , p r }. Moreover, there exists a 
neighborhood V q ofq in N such that f~ 1 (V q ) is a disjoint union of neighborhoods 
of pi, each of which is mapped dijfeomorphically onto V q . 

1 Again, we assume that both you and Spot remain fixed in place, and that you do not detach the leash from your 
wrist or Spot’s collar. 

2 In condensed matter physics, quantum mechanics, and quantum field theory, higher-dimensional winding 
numbers are usually called topological quantum numbers. For more on this see [66] or [67]. 
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C T • Ui 
C T -p Q u 3 
C T~ ^2 

d »pQ 

/ 

y 

(b) 

Figure 9.1 A stack of records in (a) one dimension (the domain manifold 
is shown) and (b) higher dimensions (fragments of the domain manifold are 
shown). 

Proof Let P := / _1 (q) C M . By the definition of a regular value together with 
the inverse function theorem, each point p e P has an open neighborhood U' p with 
the property that / restricted to U' p is a diffeomorphism. The union C := [_J /; U’ p 
is an open cover of P. But M is compact, so P is compact (cf. Exercise 3.12). 
Hence we may choose a finite subcover C C C . Each U' p e C can contain only 
one preimage of q (otherwise / is not locally bijective), so the number of points 

p g P is finite, say p\ p r . By taking smaller neighborhoods if necessary 

we may also assume that the U' are disjoint. Now define V[ := f(U' ) and set 
V q = 0, V;. Define U, := f~\V q ) (T U’ pi . Then f~W q ) = Li, U, (a disjoint 
union). By construction, / restricted to each £/,- is a diffeomorphism. □ 

Now let / : M — > be a smooth map between compact, connected, oriented 
manifolds without boundary and of the same dimension. Let q be a regular value of 
/, and let /“'(<?) = {p t, • • • , Pr}- Define the index /(/; p ,) of the point p , to be 
the sign of the Jacobian determinant of / at p,. (This sign is well defined because 
both manifolds have a fixed orientation, so / is either orientation preserving, with 



/ 

4 — 


q 

(a) 
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sign +1, or orientation reversing, with sign —1.) The degree of / is the sum of the 
indices of all the points in 


deg /:=£/(/; />,-)• (9.1) 

;=i 


As the notation suggests, the degree does not depend on the regular value chosen 
for its definition. Moreover, it is a homotopy invariant, as / ~ g implies deg / = 
deg g. We will prove both these facts by developing an alternate definition of the 
degree of a map. 

By Poincare duality and the connectivity of M we have H n (M) = H°(M ) = R, 
where the isomorphism H"(M ) = R is given by integration over M. As /* : 
H n (N ) — > H n (M ) is an isomorphism of vector spaces we get the following 
commutative diagram: 

H n (N ) — H n (M ) 


Deg / 


(9.2) 


The lower map is a linear isomorphism between one-dimensional vector spaces 
and is therefore simply multiplication by an element Deg / e R that depends only 
on /. Many authors define this to be the degree of /, which is justified by the 
following fact. 


Theorem 9.2 Let /, M, and N be as above. Then 

Deg / = deg /. 


Proof We apply the stack of records theorem (and use the same notation). Assume 
q is a regular value of /. Let co e Q”(N) be an 72-form whose support is contained 
in V q . Define f t := /If/,. Then, as each / is a diffeomorphism, the change of 
variables theorem gives 



f m = ( de § /) f c °- 

JV a JN 


□ 


It follows that deg / does not depend on the regular value q used for its defi- 
nition, as the definition of Deg / makes no mention of regular values. Moreover, 
homotopic maps induce the same map in cohomology, so Deg / is a homotopy 
invariant. Thus, deg / enjoys this attribute as well. But Theorem 9.2 is a two-way 
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street, for if one were to define the degree of / to be Deg(/) then there would 
be no a priori reason for it to be an integer. Yet it is, because deg / is always an 
integer. 

EXERCISE 9.1 Given / : M — »■ N and G : N — »• P , show that the degree is 
multiplicative: 


Deg (g o /) = (Deg g) (Deg/). 

EXERCISE 9.2 Let / : S 2 T 2 be a smooth map from the 2-sphere to the 2-torus. 
Show that deg / = 0. Hint: The form dx A dy on the torus is a generator of its 
top-dimensional cohomology. What happens when you pull it back and integrate it 
over the 2-sphere? 


EXERCISE 9.3 Let M be a compact, connected, oriented, smooth manifold without 
boundary. Fix a point x e R" +1 , and let / : M — > R' !+1 — {x} be a smooth map. 
Define a smooth map g : M —> S n e R" +l by 


g(p) = 


f(p ) ~ x 
f(P ) - x 


By definition, the winding number of / (about jc) is the degree of g. Show that, 
when M = S l and n — 1, the winding number of / is precisely the winding number 
we encountered when walking Spot. That is, show that deg g counts the number of 
times / wraps the circle around the point x. Hint: There are many ways to do this but 
perhaps the quickest is just to observe that, by homotopy invariance, we may assume 
that / is given by the map 9 m- <p = kO. Choose the closed but not exact 1-form 
dtp on the image circle and pull it back to the domain circle. Otherwise, you have 
to show that the inverse image of a regular point contains k points and that all the 
Jacobians are of the same sign. 


The above exercise shows that the degree of a map is a generalization of the notion 
of a winding number to higher dimensions. Intuitively, it counts the number of 
times M is wrapped around S" by /. 


9.1 The hairy ball theorem and the Hopf fibration 

Of particular interest is the degree of a map from one sphere to another. It is imme- 
diate from the definitions that the identity map id : S n -> S' 1 has degree equal 
to +1. It is also easy to see that the antipodal map —id : S n — »■ S n that sends 
x m>- —x (viewing S" as the unit sphere in M" +1 ) has degree equal to (— 1)" +1 . 
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EXERCISE 9.4 Prove that the antipodal map has degree (— 1)' !+1 . Hint: See 
Exercise 8.60. 

We can use these facts to prove a beautiful little theorem with an ugly name: the 
hairy ball theorem. This theorem says that only odd-dimensional spheres have 
nowhere-vanishing tangent vector fields. The reason why people call it the hairy 
ball theorem is that the theorem says that you cannot comb the hair (really, fuzz) 
on a tennis ball without leaving a cowlick somewhere. 4 Alternatively, and much 
more interestingly, the theorem implies that there is always at least one point on 
the Earth where the wind is not blowing. 

Theorem 9.3 (The hairy ball theorem) The sphere S n admits a nowhere-zero 
tangent vector field if and only ifn is odd. 

Proof Identify x e S n with the vector from the origin to x in M" +1 . Define the 
unit normal vector N = x/||jc||. If n — 2k — 1 is odd then the vector field X = 
(x 2 , —x 1 , x 4 , —x 3 , . . . , x 2k , —x 2k ~ l ) e M 24 is everywhere orthogonal to N, so it is 
everywhere tangent to the sphere and nonvanishing. 

Conversely, let n be even and suppose X is a nowhere-zero tangent vector field 
on the sphere. Then there exists a nowhere-zero tangent vector field Y := X/||X|| 
on the sphere of unit length. Consider the map 

h(x,t) : S n x [0, 1] — M" +1 


given by 


h[x, t) = cos(7tT) N + sin(7Tf) Y. 


For fixed t, h maps the sphere to itself (because \\h{t, jc)|| = 1). Also, li (x , t) = x 
for t = 0 and h (x , t) — — x for t — 1, so h is a homotopy between id and —id. 
Hence 1 = deg(/d) = deg(— id) = (— 1)" +1 , a contradiction. □ 


EXERCISE 9.5 Show that, if / 
Hint: Define 

F : I x S n -x S n by 


S" — > S" has no fixed points, deg / = (— 1)" +1 . 

, (1 - t)f(x) + t(-x) 

Fit, x) = . 

\\(l-t)f(x) + t(-x)\\ 


' Many Europeans prefer to call it the hedgehog theorem, which is much more civilized. 

4 Note that you can comb the hair on your head without leaving a cowlick, because your head (or at least, the 
place where you have long hair) is more like a hemisphere, so the hair can all be directed to one side. 
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Show that F(t, x) is a well-defined smooth homotopy between / and the antipodal 
map. 

EXERCISE 9.6 Suppose that / : S n — > S n fails to be surjective. Show that deg / = 
0. Hint: See Exercise 4.12. 

EXERCISE 9.7 Use the notion of the degree of a map to prove Brouwer’s fixed 
point theorem (cf. Example 4.6). Hint: Let IV = {x e 5" fl R" +l : x' ,+l > 0} be the 
northern hemisphere of the n-sphere, and define the southern hemisphere S similarly. 
Let < 'Pn : N B" and <ps '■ S — > B" be diffeomorphisms that agree on N fl S. Let 
/ : B" — > B" be a smooth map without fixed points, and define a new (smooth) map 
g:S n -+ S n by 

^ _ l^ 1 ° / ° <Pn(x), if x e IV, and 
| o / o tps(x) ifx e S. 

Look at possible fixed points of g and use Exercises 9.5 and 9.6. 

So far we have discussed maps between spheres of the same dimension and have 
discovered that the degree of such a map is always an integer. Put another way, 
all the maps from one sphere to another of the same dimension fall into homotopy 
classes labeled by the integers. 

Can we say anything about maps from S m to S n when m yf n‘l The answer 
turns out to be easy when m < n. To see this, consider a map from a circle into a 
2-sphere. The 2-sphere is simply connected, so we can always shrink the image of 
the circle to a point while remaining on the 2-sphere. In other words, every such 
map is null homotopic. This kind of behavior occurs again in higher dimensions, 
because any map from a lower-dimensional sphere to a higher dimensional sphere 
can be deformed to a map that misses at least one point, so by Exercise 4.12 it must 
be homotopically trivial. 

The case m > n is quite subtle, and is still not entirely understood despite much 
work. One of the most striking things is really that such maps exist at all. 5 The 
classic example in this area is the Hopf fibration, which is a certain smooth map 
h : S 3 — S 2 whose precise definition would take us too far afield. It is called a 
“fibration” because it expresses the 3-sphere as a smooth collection of fibers (the 
sets / -1 (x)), all of which are copies of S 1 . It turns out that the Hopf fibration has 
degree 1, so there is at least one continuous map from S 3 to S 2 that cannot be 
deformed to a point, a highly nonintuitive result! 6 

5 Of course, there exist all sorts of maps from S'" to S" with m > n. The surprising fact is that some of them are 
continuous and even smooth. 

For a discussion of the Hopf map and a computation of its degree, see e.g. [12], Ex. 17.23. 
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(a) 



Figure 9.2 Some knots, (a) The unknot, (b) The trefoil, (c) The torus knot (3, 4). 


9.2 Linking numbers and magnetostatics 

The degree of a map has many other interesting uses. Here’s one that is related 
to knot theory, a fascinating branch of mathematics that studies the topological 
properties of simple closed curves embedded in three-dimensional manifolds. 

For our putposes, we define a knot to be a circle embedded in R 3 and a link to 
be a bunch of disjoint knots (called the components of the link). The unknot is the 
simplest knot, namely just the circle, and the unlink is a bunch of disjoint circles 
in the plane. (See Figures 9.2 and 9.3.) 

Intuitively we can think of a knot as a closed string, and two knots are the same 
if we can deform one until it looks just like the other (without cutting the string, of 
course). More precisely, two knots K\ and Ki are isotopic if there exists a smooth 
map / : R 3 x [0, 1] — > R 3 such that f, := /(-, t) : R 3 — R 3 is a diffeomorphism 
satisfying f 0 — id and J\ (K\) = K 2 . One of the primary objectives of knot theory 
is to find good tools to distinguish which knots and links are trivial (isotopic to the 
unknot or the unlink) and which are not. 

Given a link, its linking number is defined as follows. The diagram of a link 
is just a generic plane projection of the link, with under and overcrossings denoted 
in the usual way. (See Figure 9.4.) Orient each component of the link arbitrarily. 
Associate a sign +1 to a right-handed crossing and a sign —1 to a left-handed 
crossing, as shown in Figure 9.5. Then the linking number of the link is one-half the 
sum of the signs of all the crossings of different components in any diagram of the 
oriented link. (Self-crossings do not count.) Figure 9.6 provides an example. Using 
something called “Reidemeister moves” one can show that the linking number is 
indeed invariant under an isotopy. Unfortunately, the linking number is not really 


7 


Nice introductions to knot theory are provided by [2] and [52]. Connections between knot theory and physics 
are discussed in [7] and [45]. 
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Figure 9.3 Some links, (a) An unlink with two components, (b) A link with two 
components, (c) The Borromean rings. 



Figure 9.4 A diagram of the trefoil. 



(a) (b) 

Figure 9.5 (a) A right-handed crossing with sign +1. (b) A left-handed crossing 
with sign — 1 . 
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Figure 9.6 The linking number of this link is — 1. 



Figure 9.7 The linking number of the Whitehead link is zero. 


C \ 



m turns 


Figure 9.8 A cross section of a solenoid with m turns carrying a current out of 
the paper at the top, and into the paper at the bottom. 


that useful, as one can see from the example of the Whitehead link, illustrated 
in Figure 9.7. It has linking number zero, but it is evidently not isotopic to the 
unlink. 

Actually the idea of a linking number is far older than this modern definition. 
Indeed, it was Gauss who first introduced the notion of linking while pondering the 
magnetic interaction of two current-carrying wires. Consider the solenoid depicted 
in Figure 9.8. Assuming a current I in the solenoid, on the one hand Ampere’s law 
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C' 


I 


Figure 9.9 A current-carrying wire loop. 


tells us that the circulation of the magnetic field around C is given by 


B ■ dl = mptol, 


where m is the number of windings in the solenoid. 

On the other hand, consider a wire loop C' carrying a current 7, as in Figure 9.9. 
The magnetic field at a point r produced by this loop is, according to the law of 
Biot and Savart, 



where r' points from the origin to a current element on C' and dl' is the 
infinitesimal displacement vector along C' in the direction of the current. 

Now let C' be the solenoid with m turns considered previously. Combining 
Ampere’s law with the expression obtained from Biot and Savart and using the 
triple product identity yields Gauss’s formula for the linking number of the two 
curves C and C'\ 



■ (dl x dl'). 


(9.3) 


Although it would be difficult (if not impossible) actually to compute the right- 
hand side of Gauss’s formula in any particular case, it does have the virtue of being 
manifestly symmetric under the interchange of C and C' . 

Gauss’s expression was generalized to higher dimensions by Calugareanu and 
White (see e.g. [61]). Let M and N be two oriented compact boundaryless 
submanifolds of M" with dim M + dim N — n — 1 . Then their linking number, 
link(M, N), is defined to be the degree of the map / : M x N —»■ M" — {0} given by 


f(x,y) = y-x. 
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Figure 9.10 Some vector fields in the plane (actually, their integral curves): (a) a 
source, (b) a sink, (c) a circulation, (d) a saddle, (e) a spiral, and (f) a dipole. 

One can show ([54], Theorem 1 1.14) that, for the case M = N = S 1 , this definition 
reduces to Gauss’s definition and coincides with the knot-theoretic definition of the 
linking number given above. As the degree is a homotopy invariant, this provides 
another proof that the linking number is invariant under isotopy. 

9.3 The Poincare-Hopf index theorem and the Gauss-Bonnet theorem 

We cannot abandon the subject of mapping degrees without at least mentioning 
two of the most important results of differential topology. To set the stage, let us 
consider a few vector fields X in the plane, as shown in Figure 9.10. The first 
observation is that all the interesting behavior occurs near a zero (or singularity) 
p, where X p = 0, because away from a zero the smoothness of the vector field 
ensures that X does not change much in magnitude or direction. 

The notion of the degree of a map allows us to construct a measure of the behav- 
ior of a vector field near a zero. Although the definition appears a bit contrived at 
first glance, it is remarkably powerful. A zero p of X is isolated if it is possible 
to find a neighborhood U of p that contains no other zeros of X. Henceforth we 
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assume all our zeros are of this type. Then we may choose a closed neighborhood 
B of p that contains no zeros of X other than p and a diffeomorphism tp : B -» B n , 
where B n e W is the usual n -ball. Let it :W — {0} — > S"~ l denote the standard 
projection map x — »■ x/ \\x || , and define 

x/s := jr o ijj, o X o tp~ l : dB" = S"~ l -> S"~ l . 

In other words, let x e dB". Then q := tp~ l {x) e dB and X q ^ 0, so by linearity 
tp*X q e M" — {0}. Hence ifr(x) e S"~ l . Now we define the index ind(X, p) of X 
to be the degree of fr . Although ind(X, p) appears to depend on tp, in fact it does 
not (see e.g. [58], p. 34). 8 

Example 9.1 Let X be a vector field in R 2 . Pick a zero of X and choose a small 
circle Cj surrounding it. Associate to each point x of Cj a point y on a second 
circle Ci, where y = X(x)/||X(x)||. Then as you walk around C\, y moves on C 2 . 
The degree deg f counts the number of times y moves counterclockwise around C 2 
minus the number of times y moves clockwise around C 2 . Applying this to the vector 
fields in Figure 9.10 gives the following indices: (a) +1, (b) +1, (c) —1, (d) —1, (e) 
+ 1, and (f) +2. 

We can now state the first result, a striking connection between vector fields and 
topology. 

Theorem 9.4 (Poincare-Hopf index theorem) Let M be a compact manifold 
without boundary, and let X be a vector field on M with only isolated zeros. Then 

^ind(X, p) = x (M), 

p 

where p ranges over all zeros of X and x (M) is the Euler characteristic of M. 

We refer the reader to [33] or [58] for a proof. As a simple application, observe 
that the theorem implies the difficult part of the hairy ball theorem, because all 
even-dimensional spheres have Euler characteristic equal to 2 (cf. Example 5.6), 
so every vector field on such a sphere must vanish somewhere. Incidentally, the 
Poincare-Hopf theorem is itself a special case of a very far-reaching theorem, 
called the Lefschetz-Hopf theorem, which relates the Lefschetz number of a 
smooth map / (defined in Exercise 5.12) to something called the total fixed point 
index of /. For more on this, see [30]. 


Note that the index of a vector field X, as we have just defined it, is the degree of a certain map \Js. But the 
degree of a map is defined as the sum of the indices of the map. In this context, the index is the sign of the 
Jacobian at a preimage of a regular value of the map. If there is only one preimage of the chosen regular value, 
then the index of the vector field X is just the index of the map i/r. This explains the use of the word index for 
both concepts. 
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The second, absolutely fundamental, result reveals a deep connection between 
the curvature of a manifold and its topology. In its simplest form, it relates the 
Gaussian curvature K of a two-dimensional manifold to its Euler characteristic 

X(M). 


Theorem 9.5 (Gauss-Bonnet) Let M be a compact, oriented, two-dimensional 
Riemannian manifold without boundary. Then 

[ KdA = 2nx(M), 

J M 

where d A is the area element of M. 


The theorem was known to Gauss and was published by Bonnet in 1848. 
Although it was first proved using more elementary methods, one can prove it, 
as well as a generalization to 2n -dimensional submanifolds of R 2,!+l , using some 
of the fancy machinery at our disposal. Very roughly, one embeds M in Euclidean 
space and shows that the left-hand side of the above equation is the degree of a cer- 
tain map, called the Gauss map (see Appendix H). The degree of the Gauss map 
is then related to the index of a certain vector field, to which the Poincare-Hopf 
theorem is then applied. (For the details see [33].) 

An intrinsic proof of the generalized Gauss-Bonnet theorem was discovered by 
Chern in the 1940s. To state that generalization we need one last definition. Let Q 
be a skew symmetric 2 n x 2 n matrix. The skew symmetry of Q ensures that the 
determinant of Q is a perfect square, and pf Q, the Pfaffian of Q, is defined to be 
its positive square root: 

pf Q = VtetQ- (9-4) 

Explicitly, 

pf Q = 7 E f f ) CT 0cr( 1 )<t(2) • • • Qcr(2n — 1 )cr(2n) • (9-5) 

Z n n\ £ — ' 

cre& 2 „ 

It can be shown that, for any matrix A, pf(A 7 QA) = (det A)(pf Q ), so the Pfaffian 
is invariant under (special) orthogonal transformations. 

EXERCISE 9.8 Define a 2-form X := \Qij O' A 6> in R 2 " . Show that 

X n = X A ■ ■ ■ • A X = n!(pf Q) <j 

n times 

where o = 0 1 A • • • A 9 2 " is the volume form. Use this expression to prove that 
pf(A 7 QA) = (det A)(pf Q). Hint for the second part : Consider what happens to X 
when Q \-+ 6' = A~ l 6. 
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Let £2 be the matrix of curvature 2-forms associated to the Levi-Civita connec- 
tion of a Riemannian manifold M of dimension 2 n. As observed in Exercise 8.6, 
the matrix £2 = (£2^) is skew symmetric in an orthonormal basis, so we may plug 
it into the above definition to get a 2/i-form pf £2 (where the ordinary product is 
replaced by the wedge product). Under a change of frame, £2 changes by a special 
orthogonal transformation (see (7.32)), so pf £2 is a well-defined (global) 2/;- form 
on M. 

Example 9.2 Although it was constructed using an orthonormal basis, the Pfaffian 
pf £2 is globally defined so it cannot depend on the basis. This is made evident by 
rewriting (9.5) in terms of the Levi-Civita tensor: 



(9.6) 


'I2n-ll2n • 


This expression, in turn, can be written in terms of the Riemann tensor using Q a /, = 
2 Rabcd@ c A 9 C ■ In particular, in dimensions 2 and 4 we have 



pf £2 = 


where, as always, a is the volume form and the carets indicate an orthonormal frame. 

An argument similar to the one given in Exercise 7.9 shows that pf £2, and there- 
fore eu £2 := (pf £2)/ (27 r)", defines a cohomology class in H ln {M). The class 
eu £2 is called the Euler class of E because of the following theorem. 

Theorem 9.6 (Chern-Gauss-Bonnet [16]) Let M be a compact, oriented, In- 
dimensional Riemannian manifold without boundary, and let £2 be the curvature 
2 -form of its Levi-Civita connection. Then 



where y (M) is the Euler characteristic of M. 

This seems a fitting place to end our brief tour of the geometry and topology of 
manifolds. 
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The theorem applies without change (although with a different proof) to pseudo-Riemannian manifolds [17]. 
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A.l Sets and maps 

A set X is a collection of objects, which we call elements. We write x € X if x is 
an element of X. We also write X = {x, y, . . . } to denote the elements of X. The 
empty set 0 = {} is the unique set containing no elements. A set U is a subset of 
X, written U C X, if x € U implies x e X. A set U is a proper subset of X, 
written U C X, if U C X and U ^ X. The union XUfof two sets X and Y is 
the set of all elements in X or in Y (or in both), whereas the intersection X D Y 
is the set of all elements that are in both X and Y . If X (T Y ^ 0 then X meets 
Y (and Y meets X). The (set-theoretic) difference of two sets X and Y is the set 
X\Y = X — Y = {x e X : x ^ T}. (This definition does not require that Y be 
contained in X.) The complement of U C X is U := X — U. For any collection 
{£/, } of subsets of X, de Morgan’s laws hold: 



0. 

i 

(A.l) 

im- 

(A.2) 


i 


A map cp from a set X to a set Y, written cp '■ X — »■ Y, assigns a unique point 
y € Y to each point x € X. We write (p (x) = y or x i->- y to denote this assignment, 
and we call X the domain of <p and Y the target of cp. If (p (x) = y for some x e X 
and y e Y then y is the image of x and x is a preimage or inverse image of y. 
The set of all preimages of y is denoted <p~ l (y). The image or range of cp is the 
set of all points in the target that are the image of some point in the domain. 

A map cp : X — >- Y between two sets X and Y is said to be injective or one-to- 
one or into if <p(x i) = <p(x 2 ) implies x\ = X 2 ■ A map cp is surjective or onto if, 
for every y e Y, there exists an x e X such that (p(x) = y. A map cp is bijective if 
it is both injective and surjective. (See Figure A.l.) 
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Figure A. 1 Types of map. 





If there is a bijection between X and Y then the two sets have the same number 
of elements. We say they have the same cardinality. The cardinality of a set X 
is denoted |X|. Sets can have finite or infinite cardinality. The natural numbers 
are the elements of N = {1, 2, . . . }. If an infinite set has the same cardinality 
as the natural numbers it is called countably infinite (or simply countable or 
denumerable); otherwise, it is uncountably infinite (or simply uncountable). 

Given a map cp : X —>■ Y and a map xp : Y — »■ Z, the composition of i// and tj>, 
written \p o 0, is a map from X to Z given by (i jr o <p){x) := \p((p{x)). (Note that 
the composition </> o ifr is not defined.) The inverse of a map (p \ X —> F is the map 
0” 1 : Y — > X satisfying cp o cp~ l = cp~ l o cp = id, where id denotes the identity 
map, given by id(x) — x for all x. Only bijective maps have inverses. 1 

An equivalence relation on a set X is a binary relation, denoted satisfying 
the following three properties. For all x, y, z e X we have 

(1) (reflexivity) x ~ x 

(2) (symmetry) x ~ y implies y ~ x 

(3) (transitivity) x ~ y and y ~ z implies x ~ z. 

Given x e X, the equivalence class of x, often denoted x or [x], is the set of all 
elements of X equivalent to *. 

Example A.l The integers are the elements of the set {. . . , —2, —1,0, 1,2,...}. 
Two integers a and b are equivalent modulo (or mod) n if the integer n divides 
a — b without remainder. In that case we write a = b mod n . Equivalence modulo n 
is easily seen be an equivalence relation on the integers. For example, if n = 5 then 
12 = 7 mod 5. Modulo 5, the equivalence class [3] = {. . . , —7, —2, 3, 8, 13, . . . }. 


1 The notation is somewhat ambiguous, as we have also denoted the preimage of a point y under a set map 
(j> : X — »■ Y by (f >~ 1 (y), and this makes sense even if </) is not bijective. The meaning should always be clear 
from the context. 
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A partition of X is a collection of subsets {Xi, X 2 , . . . , X, } of X, called the 
parts of the partition, such that every element of X is in one and only one part. A 
partition of X is the same thing as an equivalence relation on X in which the parts 
of the partition are the equivalence classes of the equivalence relation. 

Example A.2 The integers modulo n, denoted Z„, are partitioned into n equiva- 
lence classes {[0], [1], . . . , [n — 1]}. 

An ordered set is a set together with a choice of order for the elements. Paren- 
theses are used to distinguish ordered sets from unordered sets. For example, 
(z, x, y) denotes an ordered set whose underlying set is {x, y, z}. The Cartesian 
product of two sets X and Y, denoted X x Y, is the set of all ordered pairs (x, y). 
We can iterate this construction to form the Cartesian product of as many sets as 
we like. For finite sets, |X x Y\ = |X||y|. 

A.2 Groups 

A nonempty set G together with a binary operation (often denoted by “+” or 
or simply by juxtaposition and called the group operation) is said to be a group 
provided the following axioms hold: 

(1) a,b e G implies that a ■ b e G; 

(2) a,b, c e G implies that a ■ (b ■ c) = (a ■ b) ■ c; 

(3) there exists an element e € G such that a ■ e = e ■ a = a for all a eG; 

(4) for every a € G there exists an element a~ l € G such that a-a~ [ = a~ l a = e. 

Axiom (1) says that G is closed under the group operation. Axiom (2) says that G is 
associative. Axiom (3) is the axiom of the identity (e is called the identity element 
of the group). Axiom (4) is the axiom of inverses, and says that every element in 
the group has an inverse. 

Example A.3 Let G — Z, where the group operation is given by ordinary addition. 
To verify that Z is a group we must verify that the axioms are satisfied. Well, certainly 
the integers are closed under addition: the sum of any two integers is again an integer. 
Ordinary addition is associative, so axiom (2) is satisfied. The identity element is 0, 
because a + 0 = 0 + a = a for any integer a. Finally, the inverse element of a is just 
—a, because a + (— a ) = (—a) + a — 0. 

Example A.4 Let G = {—1, 1), where the group operation is now ordinary 
multiplication. Clearly G is closed under multiplication. Ordinary multiplication is 
associative, so axiom (2) is satisfied. The identity element is 1, because a ■ 1 = 
1 ■ a = a for any a e G. Finally, each element is its own inverse. This group is 
usually denoted Z 2 . 
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Example A.5 Let G — {e, a, a 2 , a 2 , . . . , a" -1 } with the group operation denoted 
by multiplication. The formal symbol a satisfies the identity a n = e. The group 
axioms are easily verified. This group is called the cyclic group of order n and is 
usually denoted C n . 

Example A.6 Let X = {1, 2, . . . , n] be a set of n objects. A permutation of X is 
a bijection a : X — »■ X. Permutations can be represented in many different ways, 
but the simplest is just to write down the images of the numbers 1 to n in a linear 
order. So, for example, if ct( 1) = 2, er( 2) = 4, er(3) = 3, and er(4) = 1 then 
we write a — 2431. Let G be the set of all permutations of X, where the group 
operation is composition. If a and r are permutations we write or for a or. That is, 
(err)(7) = o(r(i)). For example, if r (1) = 4, r(2) = 2, r(3) = 3, and r(4) = 1 then 
r = 4231 and err = 1432. Evidently, G is closed under composition (which is also 
associative). The identity element of G is just the identity map, given by o(i) = i for 
all i. For example, the identity permutation of @4 is e = 1234. Finally, the inverse 
of a permutation o is just the inverse map a -1 , which satisfies era -1 = a _1 a = e. 
For example, if o = 2431 then er _1 =4132. The collection of all permutations of X 
is called the symmetric (or permutation) group on n elements and is denoted ©„. 
It contains n\ elements. 

A transposition is a permutation that switches two numbers and leaves the rest 
fixed. For example, the permutation 4231 is a transposition because it flips 1 and 
4 and leaves 2 and 3 alone. For simplicity one usually denotes this transposition by 
1 -o 4. It is not too difficult to see that 6 n is generated by transpositions. This means 
that any permutation 0 can be written as the product of transpositions. 

A permutation 0 is even if it can be expressed as the product of an even number of 
transpositions; otherwise it is odd. The sign of a permutation a, written (— 1) CT , is 
+ 1 if it is even and — 1 if it is odd. One can show that the sign of a permutation is the 
number of transpositions required to transform it back to the identity permutation. 
So 2431 is an even permutation (sign +1) because we can get back to the identity 
permutation in two steps: 2431 2 > 1432 4 > 1234. Although a given permuta- 

tion a can be written in many different ways as a product of transpositions, it turns 
out that the sign of a is always the same. Furthermore, as the notation is meant to 
suggest, (— 1) CTT = (— 1) CT (— l) r . Both these claims require proof, which we omit. 

Example A. 7 The set SO(n) of all rotations in E" (endomorphisms of E" that 
preserve the Euclidean inner product and have unit determinant) is a subgroup (under 
composition) of the orthogonal group Oiti). The set SO(n ) is called the special 
orthogonal group. 

Remark Examples A.3-A.6 are finite groups, while Example A.7 is an infinite 

group. 


A. 2 Groups 


267 


A group G is said to be abelian or commutative if a ■ b — b ■ a for every pair 
of elements a, b in G. The groups Z, Z 2 , and C n are all abelian, whereas 50(3) is 
nonabelian. 

Example A.8 For n > 2, 6 n is nonabelian. For example, if n = 3 and a = 132 
and r = 213 then err = 231 whereas rer = 312. This implies the claim, as G„ 
is naturally a subgroup (a subset that is also a group in its own right) of & m when 
n < m. 

Next we introduce a crucial concept in the theory of algebraic structures. A map 
4> : G H from a group G to a group H satisfying (p(gh) = e/> (#)</> (/? ) is called 

a (group) homomorphism. 

Remark In the above definition, the first product gh is the product in G, while 
the second product 4>{g)cj)(h) is the product in H . 

Remark The word “homomorphism” is from the Greek roots 6 p.o£, meaning 
similar or akin, and popra), meaning shape or form. Two groups are homomor- 
phic if there is a homomorphism between them. In that case they are indeed similar 
in form, because the homomorphism establishes a link between the two group 
structures. 

Example A.9 The map 4> : C 3 -» 63 taking a to 231 and extended homomor- 
phically to the rest of C 3 is a homomorphism. (“Extended homomorphically” means 
that we require the map act homomorphically on the entire group, subject to the ini- 
tial constraint given for the image of a ; in our case this means that we require (f> to 
satisfy the condition 4>(a m ) = <p(a) m for all m.) 

Example A.10 The map (p : Z — > Z 2 taking every even integer to + 1 and every 
odd integer to —1 is a homomorphism, because <p(a + b) = (p(a)(p(b) for all a and 
b. (Recall that the group “product” in Z is just addition.) 

Let G and H be groups. A homomorphism </> : G — > H is called an isomor- 
phism if 0 is a bijection. Groups G and H are said to be isomorphic if there is 
an isomorphism between them. Isomorphic groups are essentially the same (the 
Greek word iooC means “equal”). A homomorphism (/; : G G from a group 
to itself is called an endomorphism. A bijective endomorphism </> : G -* G is 
called an automorphism. An endomorphism is not necessarily one-to-one or onto, 
which means it could map G to part of itself in a funny way. An automorphism is 
just about the most innocuous sort of map around - it basically permutes the group 
elements. 
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Example A.ll Let G be C 3, and consider the map </> : C3 -* C3 given by <p(a) = 
a 2 and extended homomorphically. The map <p is an automorphism. 

Given a group G and a space X, a (left) action of G on X is a map GxX->X 
given by (g, x) i-» gx and satisfying two axioms: 

(1) ex = x for all x € X; 

(2) ( gh)x = g(hx) for all g, h e G and x e X. 

If gx = x implies g = e then the action of G on X is free. 

A.3 Rings and fields 

Next we need the idea of a field. The intuitive definition of a field is a set of ele- 
ments (called scalars) that possess the same kinds of properties as do the real 
numbers. So, for example, any three elements a, b, and c in a field satisfy the 
distributive law: a ■ (b + c) = a ■ b + a ■ c. 

More precisely, a nonempty set R together with two operations, addition and 
multiplication, denoted + and • respectively, is said to be an associative ring if, for 
all a, b, and c in R, the following hold: 

(1) a T b £ R\ 

(2) a + b = b + a\ 

(3) (a + b) + c = a + (b + c); 

(4) there is an element 0 e R such that for every a € R we have a + 0 = a ; 

(5) for every a € R there is an element —a € R such that a + (— a ) = 0; 

(6) a ■ b is in R\ 

(7) a ■ (b ■ c) — (a ■ b) ■ c\ 

(8) a ■ {b + c) = a- b + a- c and (b + c) ■ a = b ■ a + c ■ cr, 

(9) there exists an element 1 € R such that for every a € R we have 

a ■ 1 = 1- a = a. 

Axioms (1) through (5) say that R is an abelian group under addition. Axiom (6) 
says that R is also closed under multiplication. Axiom (7) says the multiplication 
is associative. 2 Axiom (8) just says that addition and multiplication are “compati- 
ble” - i.e., they satisfy the usual distributive axioms. If Axiom (9) is satisfied, we 
say that R is a ring with unit. 

Now, you may have noticed that axioms (6), (7), and (9) are just one axiom shy 
of making R into a group under multiplication as well. What is missing are the 
multiplicative inverses. If it should be the case that every nonzero element of the 
ring has a multiplicative inverse then the ring is called a division ring or skew 
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There are examples of nonassociative rings (for example, the octonions) but we shall not consider them here. 
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field. If the multiplication of the division ring is commutative, so that R is an 
abelian group under multiplication (in addition to being an abelian group under 
addition), then the structure is called a field. 

Example A. 12 The integers Z with the usual operations of addition and multipli- 
cation form a ring with unit. 

Example A.13 The even integers with the usual operations of addition and 
multiplication form a ring without unit. 

Example A.14 The rational numbers (Q) with the usual operations of addition and 
multiplication form a field, as do the real numbers R. 

Example A.15 Recall how the complex numbers work. We define i = -J — I . Then 
any complex number is of the form z. = a + ib where a and b are real numbers (called 
the real and imaginary parts of z, respectively). We define addition and multiplication 
as follows. Let z = a + ib and w = c + id. Then z + w = (a + c) + i(b + d) and 
Z- w = {a + ib){c + id) = (ac — bd ) + i {ad + be). The set of complex numbers C 
with these operations is a field. 

The field of complex numbers admits a special linear involution (often denoted by 
an asterisk, but here denoted by a bar), namely complex conjugation, which leaves 
real numbers invariant and sends i to —i. Thus, for example, a + ib = a — ib. The 
modulus of z e C is |z| = (zz) 1//2 . 

Example A.16 In this example we define the quaternions Q, which were first 
discovered by Sir William Rowan Hamilton (the same person who gave you the 
Hamiltonian). Elements of Q are of the form q — a + ib + jc + kd. where a , b, c, 
and d are real numbers and i, j, and k are formal symbols that generalize the i of 
the complex numbers. We define the addition of quaternions as we did for complex 
numbers, namely the symbols i, j, and k act as place holders. So, for example, if 
q — a + ib + jc + kd and p — e + if + jg + kh then q + p = {a + e) + i{b + f) + 
j (c + g) + k(d + h). We define the multiplicative properties of these formal symbols 
as follows: i 2 = j 2 = k 2 = — 1, ij = k = —ji, jk — i = —kj, and ki = j = — ik . 
(You may notice a certain resemblance to the cross product.) With these rules, the 
quaternions form a classic example of a skew field (for which multiplication is not 
commutative). 

Example A.17 Let Z p be the set of all integers mod p. When p is prime, Z p is a 
field. First notice that Z p only has p elements, namely { [0], [1], [2], . . . , [p-1] }. (For 
this reason, Z p is called a finite field.) The addition and multiplication of equivalence 
classes are all defined modulo p. So, for example, let p = 7. Then [4] + [6] = [3], 
because (4 + lx) + (6 + ly) = 3 + 7(x + y + 1). Similarly, [3] • [4] = [5] because 
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(3 + 7x)(4 + ly) = 5 + 7(4x + 3y + 1). One can show that every class has both an 
additive and multiplicative inverse, so h p forms a field. 


A.4 Vector spaces 

At last we come to the formal definition of a vector space. A nonempty set V is 
said to be a vector space over a field IF if V is an abelian group under addition, and 
for every a e F and v e V we have: 

(1) av e V; 

(2) a(v + w) = av + aw, 

(3) ( a + b)v = av + bv, 

(4) a(bv) = (i ab)v; 

(5) In = v. 

Here 1 is the multiplicative identity in F. The elements of V are called vectors 
while the elements of F are called scalars. The quantity a v is called a scalar multi- 
ple of v. In general, the quantity av+bvo for scalars a, b € F and vectors v, w e V 
is called a F-linear combination (or simply a linear combination if the field F is 
obvious) of v and w. 

It is worth observing some simple consequences of the axioms. For example, by 
axiom (3) we have On = (0 + 0)t> = 0u + On, so On = 0. Similarly one can show 
that aO = 0 and (-a)v = —(av). 


Appendix B 
The spectral theorem 


In this appendix we provide a quick proof of the spectral theorem of linear algebra. 
To do so at the right level of generality requires that we work over the complex 
numbers. Thus, let V be a vector space over the complex numbers equipped with 
the usual sesquilinear inner product (•, •) (for brevity we drop explicit mention of 
the map g ). An operator A (respectively, matrix A) is called Hermitian or self- 
adjoint if A = A 1 " (respectively, A = A + ). 1 An operator U (respectively, matrix 
U) is unitary if U' = U~ l (respectively, U" = U~ x ). A matrix A is diagonalized 
by a matrix U if U~ l AU = D for some diagonal matrix D. 

Theorem B.l (Spectral theorem) Every Hermitian matrix can be diagonalized 
by a unitary matrix. 

Proof Let A : V —>■ V be any linear operator. Then A has at least one (nontrivial) 
eigenvector v, since 

Av = Xv det(A — A/) = 0. 

But the latter is a polynomial equation in A of degree n := dim V, which always 
has a solution by virtue of the fundamental theorem of algebra. 2 

Now assume A is Hermitian, and let W := (span i;) 1 be the orthogonal 
complement of the linear span of the eigenvector v, with w € W. Then 

(Aw, v ) — (w, Av) = (w, Xv) = A (w, v) = 0. 

In particular, A maps any element of W to another element of IT (IT is called 
an invariant subspace of A). Therefore we may repeat the process to find an 

1 Warning: “Hermitian” and “self-adjoint” are distinct notions in infinite dimensions. See e.g. [74], 

- This is why we need to work over the complex numbers. One can develop a proof of the spectral theorem for 
real symmetric matrices without going through the complex numbers first, but it takes a little longer and proves 
less. 
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eigenvector of A lying in W . By induction, we obtain a basis of V consisting 
of eigenvectors of A. By construction these eigenvectors are orthogonal, and by 
normalizing they can be made orthonormal. 3 

Let {e, } be any orthonormal basis for V, and let A be the matrix representing 
A in this basis. Let {u, } be the basis of eigenvectors of A just constructed, with 
corresponding eigenvalues A;. Define U : V — > V by Uei — u,-, 1 < i < n. Then 
U is unitary, because 

Oi , U'Uej) = ( Uei , Uef) = (v t , vj) = % = (e,-, ej), 

and therefore U T U = I . Also, 

(e,, U~ x AUef) = (Uei, AUef) = (Vj, Avj) = A j(vi , Vj) = 

It follows that if U is the matrix representing U in the basis {<?, } then U~ l AU = D, 
where D is a diagonal matrix with the eigenvalues of A along the diagonal. □ 

Remark As Uej = JT the columns of the matrix U are just the 

components of the vectors {n, } in the basis {e, }. 

Remark Note that the entries of the diagonal matrix D in the theorem are neces- 
sarily real because A is Hermitian: if v is a nontrivial eigenvector with eigenvalue 
A then 

A(v, v ) = ( Av , v ) = (n, Av ) = A(u, v), 
so A = A because (v, v) > 0. 

Corollary B.2 Every real symmetric matrix can be diagonalized by an orthogo- 
nal matrix. 

Proof There are two ways to proceed. Naively, we just restrict everything in the 
previous theorem to real numbers. A real Hermitian matrix is just a real symmetric 
matrix, and a real unitary matrix is just an orthogonal matrix, so the result more 
or less follows. The problem is this: the theorem says that if we start with a real 
symmetric matrix, namely a special kind of Hermitian matrix, there is a unitary 
matrix that diagonalizes it. How do we know that the entries of that unitary matrix 
are real? We need another argument. 

Here is one that works. Start with a real symmetric matrix A. By the first part 
of the proof of the theorem, it has a nontrivial eigenvector v whose entries may be 
complex. So write v = x + iy where x and y are real. Now 

A(jr + iy) — Xv = Av = Ax + iAy. 


3 


In practice, one generally needs Gram-Schmidt orthonormalization. See e.g. [43]. (For the real case, see 
Exercise 1.52.) 
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But A and X are both real, so equating real and imaginary parts gives 
Ax = Xx and Ay = Xy. 

We must have x i=- 0 or _y ^ 0, so we have found a nonzero real eigenvector of A. 
Now proceed as in the proof of the theorem. □ 


Appendix C 

Orientations and top-dimensional forms 


Recall that a manifold M with atlas {((/,. <p, )} (see Section 3.4) is oriented if all the 
transition functions (pjoipf 1 have positive Jacobian determinant, and it is orientable 
if it can be oriented. This is not a very useful characterization because there are too 
many transition functions to check. Fortunately, we can use the pullback map to 
devise a nicer criterion for ascertaining orientability. 

Theorem C.l An n -dimensional manifold M is orientable if and only if it has a 
global nowhere-vanishing n-form. 

Remark A global nowhere- vanishing n-form on an n -dimensional manifold is 
called a volume form. 

Proof Suppose that u> is a global nowhere-vanishing n-form. Any other n-form 
on M is a multiple of this one, so <p*(dx x A • • • A dx n ) = fco for some smooth 
nowhere- vanishing function /; on Uj . We cannot have /, > 0 and fj < 0 for two 
overlapping coordinate patches, or else by continuity f or fj would have to be zero 
somewhere. Thus, without loss of generality we may choose /} > 0 everywhere. 
It follows that, for every pair of patches, the transition functions <pj o (p~ l pull 
back dx 1 A • • • A dx n to a positive multiple of itself. This, in turn, implies that the 
Jacobian determinants of the transition functions are all positive. 

Conversely, suppose that M has an oriented atlas {((/,, <p,) } . Then 

(<Pj o <p7 l )*(dx l A • • • A dx n ) = h dx 1 A • • • A dx n 

for some positive function h . Hence, 

(p*(dx l A • • • A dx ' 1 ) = (cp*h)(p* (dx 1 A • • • A dx"). 

Writing a>, for <p*(dx 1 A • • • A dx") we get a>j = fco, for some positive function 
/ = <p*h on Ui n Uj. 
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Let {pi} be a partition of unity subordinate to the open cover {(/,}, and define 
to = JL pjCOj. At each point in M, the forms {«, } are positive multiples of each 
other whenever they are defined. As p, > 0 and not all the p, can vanish at a point, 
co is nowhere vanishing. □ 

As the proof shows, all global nowhere-vanishing /i-forms are multiples of each 
other by nowhere-vanishing functions. Thus, on a connected manifold, the set of 
all such forms can be divided into two equivalence classes, depending on whether 
this multiple is positive or negative. Either class determines an orientation class 
(or, more simply, orientation) of M. 


Appendix D 

Riemann normal coordinates 


Let x 1 , . . . , x" be the usual coordinates on M' ! . Technically, the corresponding vec- 
tor fields 3/3 jc 1 , . . . , d/dx 11 live in T M" , the tangent space to Euclidean space. But 
we always identify M" and TW, because both are vector spaces. Under the natu- 
ral identification, a step in the coordinate direction x 1 is the same thing as moving 
from the tail to the tip of d/dx': this works because d/dx‘ is a unit vector. 

There is a natural analogue of this correspondence in the vicinity of a point p on 
any Riemannian manifold M. Given a tangent vector X p we define Exp X p to be 
the point on M obtained by moving a unit parameter distance along the (unique) 
geodesic y whose tangent vector at p is X p . The map Exp : T M ^ M is called 
the exponential map. (See Figure D.l.) 

Prima facie this construction seems to be totally arbitrary, as it appears to depend 
on the parameterization. But appearances can be deceiving. According to (8.14), 
the distance along the curve corresponding to a unit parameter distance is 

r l / ^ i ^ j r 1 

l V 8ijit) ~dT~dr dt = J Q y/8( X ’X)dt = Jg{X p ,X p ), (D.l) 

where X = (dx 1 /dt)(d/dx‘) denotes the tangent vector field to y. The second 
equality holds because the curve is a geodesic and so g(X, X) is constant along the 
curve. Hence, moving a unit parameter distance along y corresponds to moving 
a distance equal to the length of the vector X p along y, and the latter quantity is 
parameterization independent. It follows that we can express the geodesic y itself 
in terms of the exponential map: 

y(0 = Exp tX p . (D.2) 

We digress briefly to discuss why Exp is called the exponential map. Let a e M 
and consider the differential operator on smooth functions given by 

, d a 2 d 2 

V = e a(d/dx) = \+a— + ^ + • • • . (D.3) 

dx 2! dx 2 
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Figure D.l The exponential map. 


By Taylor’s theorem 

T>f(x) = f(x + a) (D.4) 

for any smooth function /. In this sense, V can be thought of as a shift operator that 
translates every point of M by a. This “exponential map” moves things a distance 
a along the “geodesic” whose tangent vector is d /dx. In general, if a e M" and 

V = e ai(d/dxi) (D.5) 

then 

Vf(x) = f(x + a). (D.6) 

The map Exp X p is a kind of analogue on a general manifold of e Xp on Euclidean 
space, whence the nomenclature. 1 

Using the exponential map we can set up a coordinate system in the neighbor- 
hood of a point p of M as follows. Let . . . , X n be a basis of T p M, and let U 
be a neighborhood of the origin in M" . Define a smooth map f : U C M" — > M by 

f(u\ u n ) = Exp(n‘Xi + h u"X n ), (D.7) 

and define coordinate functions x‘ on M by x'(f(u l , . . . , u n )) = u l . These coor- 
dinates are called geodesic coordinates or Riemann normal coordinates in the 


1 The difference is that e x P is a differential operator whereas Exp Xp is a point of M. 
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neighborhood of p — /(0, . . . , 0). ; With this out of the way, we turn to the main 
theorem. 

Theorem D.l Near p we can always choose coordinates such that the metric 
tensor components satisfy 


gij(p) = hi and gij.kiP ) = 0. 


(D.8) 


In particular, we can always choose coordinates such that the Christoffel symbols 
vanish at p. 

Remark Coordinates in which the metric satisfies (D.8) are sometimes called 
locally flat coordinates. This terminology can be very confusing, though, because, 
as observed in Section 8.5, the term “locally flat” often refers a manifold whose 
Riemann tensor vanishes everywhere, which we do not assume here. Perhaps a 
better term would therefore be locally Euclidean coordinates, although this does 
not appear to be much of an improvement. 

When the signature is Lorentzian instead of Euclidean, the theorem holds with 
S ij replaced by r/,y (the Minkowski metric tensor components), and the coordinates 
are called local Lorentz coordinates or local inertial coordinates. The latter ter- 
minology comes from the fact that, in special relativity, constant- velocity observers 
are inertial observers - for them, Newton’s second law holds. In general relativity, 
Theorem D. 1 is one form of the equivalence principle, which says that, in a small 
enough region, the effects of gravity (as embodied in the Christoffel symbols) can 
be transformed away. 

EXERCISE D.l Show that, in a coordinate system satisfying the conditions of 

Theorem D. 1 , the components of the Riemann tensor can be written 


P-ijkt — r^lgil,jk gjl,ik gik,jt + 8jk,u)- 


(D.9) 


We offer two proofs of Theorem D.l. The first uses Riemann normal coordinates 
while the second is a less rigorous but nonetheless insightful “counting” proof. 

Proof 1 Fix a constant vector a in M" and consider the (radial) line in M" given 
by u'(t) = a't. The image of this line under / is the geodesic in M given (cf. 
(D.2)) by 


y(t) = Exp ta‘ Xj, 


(D. 10) 


2 


Generally these coordinates only exist in a small neighborhood of p, because the curvature of the manifold 
will typically cause the geodesic lines to cross at some point q different from p. For more about this see [11], 
Section VII. 6. 
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whose tangent vector at t = 0 is a' X,. In terms of our local coordinates this is 
the same thing as saying that the curve on M given by x l (y(t)) = u‘ = a't is 
a geodesic. Plugging this into the geodesic equation (8.65) gives Y k ija'ai = 0. 
Choosing n linearly independent vectors ci\, ... ,ci n allows us to conclude that 
T k ij = 0. From (8.39) we obtain 


gij.k — ^ jki ~b f ikj . 


(D.l 1) 


so gij k = 0 as advertised. 

If gjj := g(Xj, X j) f 8jj we can always choose a new basis {X'f for the tangent 
space such that := g{Xf X'- ) = <V/ at p. If we define X' := Xjbf> for some 
nonsingular constant matrix b then 

gvy = giX',, X ) ) = b k v b l r g u . (D.12) 

By the same argument as that given in the second proof of Theorem 1.5, we 
can choose b such that ggy = Moreover, because b is constant, gr j’.k' = 
b m i’b n j’g mn ,k = 0. □ 


Proof 2 Because the metric is a tensor, its components must transform according 
to (3.53): 

3x^ 3x^ 

gi>jfy) = gij(x(y))j-i^-7- (D.13) 

A general coordinate transformation can be Taylor expanded about p: 


x‘(y) = x l (y p ) + 


dx‘ 


dyj 


C y J - y J P ) 


1 / d 2 x‘ 
+ - 


2 \dyJ'dy k ' J & ~ ~ ft 


1 

+ - 


3 v i 


d i x 




At p there are if freely adjustable numbers 



which we can use in (D.13) to put the (") metric components into Euclidean form. 
The remaining n(n + l)/2 parameters are exactly the number of parameters in 


3 Technically, we can only conclude that = 0, where -A(y) := j (A t j + A ), hut as the ChristolTc] symbol 

is symmetric in its lower indices we need not worry about the symmetrization issue. 
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the orthogonal group 0(n), every element of which leaves the Euclidean metric 
invariant. 

Differentiating both sides of (D.13) yields equations for the first deriva- 

tives of the metric tensor components. Those equations involve second derivatives 
of the x l of the form 


d 2 x' 

dyj'dy k ' 


p 


As there are precisely n ("^ ') freely specifiable second derivative terms of this form, 
we can choose them in such a way that all the first derivatives of the metric vanish. 

□ 


EXERCISE D.2 Show that there are a \= « ("^ 2 ) choices for the numbers 

( 9V \ 

\dyj'dy k 'dy e ' ) p ’ 

but b := ("j 1 ) second derivatives of the metric. Interpret the significance of b — a 
in the light of (8.51). 


Appendix E 

Holonomy of an infinitesimal loop 


In this appendix we derive the precise form of ( 8 . 87 ). Pick local coordinates 
x l , . . . , x" in a neighborhood of p on M, and let a be a small number. Fix two 
indices r and s. Holding all the coordinates constant except x r and x s , choose four 
points pi to p 4 as follows: (x r (pi), x s (pi)) = (0,0); (x r (p2), x s (p2)) = (a, 0); 
(x r ip?) , x s (p^)) = (a, a); and (x r (p 4 ), x s (p4)) = ( 0 , a). Let y a (t) be the path fol- 
lowing the coordinate curves from p a to p a+ \ with p 5 = p\, and set y = y\ 72/374- 
(See Figure E.l.) 

Recall that A k j — V k ij(dy' /dt) = T k ij{dx l /dt). To simplify the notation, 
define a matrix T, such that (r,)* ; - = T k ij. Then A = Y j(dx l /dt), so that 
Adt = Yjdx 1 . Let us write A <a> to denote A restricted to path y a . Thus, for exam- 
ple, A (1) dt = T,- dx r (no sum on r), because y\ is just the rth coordinate curve. 
Hence, using a Taylor expansion about p\ (and setting x k = 0 for all k except 
k = r, s), 


pa /•(«, 0 ) 

/ A (1) dt = / r r dx r 
Jo 0 . 


'( 0 , 0 ) 


/•(“■ 0) / gr 

/ r,.(0,0) + ^4(0,0) x r ) dx r 

J (0,0) V 


dx r 


a 2 ar, 

« «r r ( 0 , 0 ) + ——^( 0 , 0 ), 

2 dx r 

where we have retained only terms of order a 2 or smaller. The argument for A (2) is 
a little different, because the base point is shifted, so we must use Taylor’s theorem 
again: 


f 


A (2) dt 



r, dx s 


^(a, 0) + 


ar, 

dx s 


(a, 0) x 


dx s 
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Pi 


Also, 


Figure E. 1 A small closed curve around a coordinate patch. 


r ( “’ a) / gr 9T 

/ r s (0,0) + ct^-f(0,0) + ^-(0,0) ) dx s 

J (a,0) V 


dx r 


9T S _ a 2 dr 


ar s (0,0) + a 2 -^(0,0) + 


dx r 


2 dx 


dx s 

( 0 , 0 ). 


poc HO, O') 

/ a (3) dt = / r r dx 
JO J(c 


(a, a) 

r(0,a) / gp \ 

/ T r (a,a) + - ! -(a,a)(x l - a)\ dx r 

J (a, a) V dx J 

r(0,a) / gp gp 

/ lr r (0, 0) + ff — ^ (0, 0) + a — ^ (0, 0) 

J( a ,a) \ dx' dx s 

9 r \ 

+ ^f(0,0)(x r ““)) dx ‘ 


7 dr r a 2 ar r 

-ar r (0,0)-a 2 H(°>°)- 

dx s 2 dx r 


( 0 , 0 ). 


Lastly, 

r^dt= f 

Jo 


( 0 , 0 ) 


T, dx s 

(O.ff) 

p(°,°) / gp 

J ' ^r,(0, a) + ^-(0, a) (x s - a) ) dx s 

p(°,°) / gp gp 

/ r,(0, 0) + a— ^(°, 0) + -HO, 0) & - a) ) dx’ 

J( 0 ,a) \ dx s dx s 

a 2 ar, 

— «r s (o,o)-— —^( 0 , 0 ). 

2 dx s 
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Keeping only terms up to order or (and dropping (0, 0) from the notation), we 
get, for example, 

%5(tt) ^ 1 — [ dt { A a \t x )+ f dt\ f dt 2 A a \t l )A il \t2) 

Jo Jo Jo 


= 1 — aF r 


a 2 


cr 


+ — r;. 


2 dx r 2 

(For the last term we need to retain only the constant term in the Taylor expansion, 
because this term is already of order a 2 .) Analogous formulae hold for all the other 
segments. Multiplying them all together according to (8.84) gives 


Hy) 


1 + aTj + 


a 2 ar ? 


2 dx s 


+ ^r- 


x ( 1 + aT r + cr 


, 9T,. a 2 9T,. 


x 1 — aT s — cr 


dx s 

9TT 

dx r 


+ 


a 


2 dx' 

a^dT, 

2 dx s 


+ ^r; 


a 


+ -r; 


x 1 — crT, — 


cr 9T r 
2 dx r 


+ -=-r; 


1 + Oi 'Rsps, 


where 


_ 9T,. 9T, „ „ 

T^rs T h FjT,. 


r r 

1 r A s 


(E.l) 


dx s dx r 

is a matrix whose entries are precisely the components of the Riemann curvature 
tensor (cf. (8.46)): 

(jz rs )‘j = i - r\p. + - r‘ rk r k sj = a",,,.. 


(E.2) 


Appendix F 

Frobenius’ theorem 


Frobenius' theorem is a cornerstone of modern differential geometry. It appears in 
many different guises and is applied in many different circumstances. At its base it 
has to do with the question of whether certain systems of partial differential equa- 
tions can be solved (i.e., integrated), but it has a much more elegant and versatile 
formulation in terms of vector fields (or forms) on manifolds. 


F.l Integrable distributions 

To motivate the discussion, we start with a smooth vector field X on a manifold 
M. By the standard theory of ordinary differential equations, given an initial point 
we can find an integral curve of X through that point. This integral curve is a one- 
dimensional submanifold of M. We want to generalize this to higher-dimensions, 
so first we need a higher-dimensional analogue of a vector field. 

A ^-dimensional distribution A of a manifold M is a smooth assignment of 
a ^-dimensional subspace A p of T p M to each point p e M. 1 The smoothness 
condition is equivalent to saying that, at every point q in some neighborhood of p, 
A q is generated by k linearly independent vector fields: 

A, = span{X 1 (<?),..., (<?)}. (F.l) 

The distribution A is integrable if for each p e M there exists a submanifold 
S containing p such that T q E = A q for all q e E. In other words, the tangent 
spaces to X all lie in the distribution A. The submanifold E is called an integral 
submanifold of A. (See Figure F.l.) 

We have seen that integral curves exist, so every one-dimensional distribution 
is integrable. The same is not true of higher-dimensional distributions. To see this, 

1 In fancy terminology, a distribution is a subbundle of the tangent bundle T M. 
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T r £ 





T ? £ 


£ 


(a) 


(b) 


Figure F. 1 (a) A distribution and (b) its integral submanifold. 


we will obtain a necessary condition for a distribution to be integrable. We say that 
A is involutive if, for all p e M and 1 < i, j < k, 

[Xi(q),Xj(q)]e A q (F.2) 

for all q in a neighborhood of p. This condition is often written symbolically as 

[A, A] c A, (F.3) 


because if (F.2) is satisfied then the Lie bracket of any element of A with any other 
element of A must again lie in A (and vice versa). 

If A is integrable then it is clearly involutive. This follows because if £ is an 
integral submanifold of A then the Lie bracket of any vector fields in the tangent 
space to £ is again in the tangent space to £ ( cf Exercise 3.20). Surprisingly, the 
converse is also true. This is the content of Frobenius’ theorem. 

Before stating the theorem, let’s see what can go wrong if A is not involutive. 
Consider the two-dimensional distribution A on M 3 spanned by the vector fields 


v 3 a 

X = b z — , 

dx dy 



(F.4) 


Note that [X, T] = —d/dy £ A. Suppose that A is integrable, and let £ be the 
integral surface through the origin. The integral curves of X (which must lie in £) 
are obtained from the equations (cf. (3.95)) 


dx 

dt 


= L 


dy 

dt 


- z(t). 


and 



(F.5) 


whose solution is (x(t), y(t ), z(t)) = (t + a,ct + b, c ) for some constants a, b, 
and c. The surface £ goes through the origin, so a = b = c = 0; this means that 
£ contains an open subset of the x axis. But the integral curves of Y all contain 


2 


Frobenius’ theorem is not actually due to Frobenius, but rather to A. Clebsch and F. Deahna. Unfortunately, 
this is how it is remembered by most people nowadays. See e.g. [49]. 
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a segment of the z axis, so E must contain a small area of the xz plane. This is a 
contradiction, however, because if z 7 ^ 0 then the vector X is not tangent to the xz 
plane (it contains a nonzero piece proportional to d/dy). 

With this example out of the way, we can now turn to a proof of the the- 
orem itself. Actually, we will prove something a bit stronger. By definition, a 
^-dimensional distribution A is completely integrable if it is integrable and for 
every p € M there is a neighborhood U of p with local coordinates x l , ... ,x n 
such that, everywhere on U, 



(F.6) 


If A is completely integrable then, in the neighborhood U , the integral submani- 
fold E looks like x k+l — Ck+ 1 , x k+2 = Ck+ 2 , . . . , x" = c„ for some constants 
Ck+ 1 , . . . , c n . We say that E looks locally straight (because we have “straight- 
ened out” the integral submanifold so that it can be made to look like in a 
neighborhood). 3 * 

Theorem F.l (Clebsch-Deahna-Frobenius) The distribution A is completely 
integrable if and only if it is involutive. 

Proof We follow the proof given in [53]. We have already shown one direction, 
so we may assume that A is involutive. First we show that we can find a coordinate 
system in which E is almost straight and then we show that we can straighten it 
out completely. □ 

Lemma F.2 There is a neighborhood V of p with coordinates y 1 , . . . , y" such 
that A is spanned by vector fields of the form 



1 < i < k. 


(F.7) 


for some k x n matrix ( bjj ) of smooth functions. 

Proof Choose local coordinates y 1 , . . . , y" in a neighborhood V' of p and 
suppose that A = span}} 7 !, . . . , T*} on V' . Then 



1 < i < k, 


for some k x n matrix (a,j) of smooth functions on V' . The T, are linearly inde- 
pendent, so {a if) has rank k. We may therefore assume that, in a (possibly smaller) 


3 Do not confuse “straight” with “flat”. A flat manifold has locally straight neighborhoods, but not necessarily 


vice versa. 
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neighborhood V c V'of /;. the k x k submatrix A' = (a,,) with 1 < i, j < k is 
nonsingular. Now set X, := Yl)=\ (d' _l ) ;/ Y r □ 

Corollary F.3 If the k- dimensional distribution A is involutive then the basis 
(F.7) satisfies [ X, , X /] = 0 for 1 < i, j < k. 

Proof By computing brackets, we note that 

13 3 ) 

[A,, Xj] e span j — , . l<i,j<k. 


But, by the involutivity of A, 

[X i ,Xj]&spm{X l ,...,X k }. 

These two vector spaces meet only in the zero vector (since span{X] , . . . , X^} 
always contains vectors proportional to d/dy‘ for 1 < i < k). □ 

It only remains to show that if {X lt . . . , X k \ are pairwise commuting linearly 
independent vector fields on a neighborhood U of a point p then there exist coor- 
dinates x 1 , . . . , x n on U such that X, = d/dx 1 for 1 < i < k. When k — n this 
is just the content of Theorem 8.2 and, when k < n, a similar but more involved 
argument can be made. Instead, we present a simple proof by induction on k that 
uses only the fundamental theorem of ordinary differential equations. The base 
case k = 1 is dealt with by the next lemma. 

Lemma F.4 Let X be a vector field nonvanishing at p. Then in a neighborhood 
of p we can choose coordinates such that X = 3/3x'. 

Proof The basic idea is simple enough. Choose local coordinates y 1 , . . . , y" in a 
neighborhood U of p, and consider the integral curves of X through q e U . Sup- 
pose the curves look schematically like those in Figure F.2, where the horizontal 
axis is v 1 and the vertical axis represents all the other coordinates. We then make a 
coordinate transformation 

(y\...,y n )^ {x\y\...,y n ), (F.8) 

where x l is the parameter along the integral curves. The only technical part of the 
proof lies in showing that this is a good coordinate transformation everywhere on 
the neighborhood U . For this, we refer the reader to ([88], Proposition 1.53). □ 

We can now complete the proof of Frobenius’ theorem. By Lemma F.4 the result 
is true for k = 1, so assume it holds for k — 1. By Corollary F.3 we may assume 
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272 , 2 / 3 , ■ • ■ 



Figure F.2 Coordinates based on integral curves. 


that A = spanJXi, . . . , X k ) with [X, , Xj ] = 0. By the inductive hypothesis, there 
exist coordinates y 1 , y k ~ l such that 


(F.9) 


spanJXj, . . . , X^_i} = span { — , . . . , 


3 y 1 ’ ’ dy k 1 


In other words 3/3 y‘ = cjjX for some nonsingular matrix (c i; ) with 1 < i, j < 
k — 1. Hence, by (F.9) again, 

3 


Jy 


T ,x k 


= [cjiXj, X k \ = —X k (cji)Xj e span , a ^ , . . . , _ x 


3 

Sy 1 


Writing X k = Yl'j = t o k d/d y j and using (F. 10) gives 


3 a ] 

— r = 0 for 1 < i < k — 1 and k < j < n. 
3 y' ~ ~ 


Now define 


fc-i 


,/ = l 


Y := X* - V a j — = J' a J 
^ dyJ ^ 


j=k 


dy 1 


Then 


span{X!, . X k ] = span | — j- , ■ ■ • , Y 


(3 3 


(F.10) 


(F- 11) 


(F.12) 


(FI 3) 


Because Y only depends on y k , ... ,y n . Lemma F.4 guarantees that we can choose 
new coordinates x l — x e (y k , . . . , y n ), k < l <n, such that Y = d/dx k . Lastly, set 
x e = / for 1 < l < k - 1. Then 


3 3 , 

A = spa " I sF 


dx k 


as desired. 


(F.14) 

□ 
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Remark To preempt possible confusion, it is worth observing that Frobenius’ 
theorem says something a bit more general but a bit less strong than Theorem 8.2. 
Given an involutive set of linearly independent vector fields X\, . . . ,X n , 
Frobenius’ theorem says that we can find local coordinates * 1 , . . . , x n in which the 
span of these vector fields coincides with the span of d/dx 1 , , d/dx n . It does 
not say that we can find local coordinates such that X' = d/dx' for all i. That is 
only true if the vector fields satisfy the stronger condition of mutual commutativity, 
namely [X,-, X ; - 1 = 0 for all i, j . 

Recall that in Section 3.16 we discussed how vector fields yield integral curves: 
given a vector field X on a manifold M , through each point of M there is a unique 
integral curve of X. The fact that these integral curves are unique means that they 
do not cross anywhere. Hence the collection of all these curves fills up M (cf. 
Figure 3.13). Similarly, if we have an involutive distribution, Frobenius’ theorem 
guarantees that there is a unique integral submanifold through every point of M, 
so that these submanifolds fill up M. One says that the integral submanifolds are 
the leaves of a foliation of M. The arborial metaphor is intended to suggest a tree 
(the manifold) filled by its leaves (the integral submanifolds). A better metaphor 
would be that of an onion, whose various layers are the integral submanifolds. 4 5 In 
any event, we have the following equivalent formulation of Frobenius' theorem. 

Theorem F.5 If A is an involutive distribution on M then the integral submani- 
folds of A foliate M. 


F.2 Pfaffian systems 

Frobenius’ theorem can also be rephrased in the language of differential forms. Let 
0 := {6 1 , ... ,9 m } be a linearly independent set of 1 -forms on a neighborhood U 
of p e M (so that 9 l A • • • A 9 m f 0 on JJ). The set 0 is called a Pfaffian system 
of 1-forms. The zero set or kernel of 0 is the set of all vector fields X annihilated 
by 0: {X : 9'(X) = 0, 1 < i < m}. Each condition 9'(X) = 0 imposes one 
constraint on X, so the zero set of 0 is an (n — m) -dimensional distribution. Thus, 
statements about distributions can be translated into statements about forms. 6 

Theorem F.6 Let 0 := {9 ] , , 6 111 } be a Pfaffian system whose kernel coincides 

(locally) with the distribution A. Then the following are equivalent. 

(1) The distribution A is involutive. 

(2) We have d9 ! A (9 l A • • • A 9 m ) = 0 for i = \,...,m. 

4 For example, the set of all 2-spheres of different radii foliate R: 4 . 

5 A foliation has a technical definition but, as we do not need it here, we refer the reader instead to, e.g., [14], 

6 The form versions are due to Frobenius, which is why his name is attached to the theorem. 
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(3) We have d0' = 0 mod Qi for 1 < i, j < m (i.e., d9‘ = i j A 6' for 

some 1 -forms A‘ j). We say that 0 is a differential ideal. 

Any of these conditions is sufficient to guarantee that A is completely integrable, 
but sometimes the form versions are more convenient. 


EXERCISE F.l Prove Theorem F.6. Hint: You may wish to use (3.123). 


The language of forms allows us to restate Frobenius’ theorem in a very useful 
way. We begin by clarifying why the submanifolds in that theorem are called “inte- 
gral”. The basic idea is that the Pfaffian system © can be viewed as a system of 
partial differential equations whose solution sets (with varying initial conditions) 
are precisely the integral submanifolds guaranteed to exist by Frobenius. This, in 
turn, ensures that 0 can be expressed in a particularly convenient manner. 

As a very simple example, consider the form 6 = (Ixy + z 2 ) dx + x 2 dy + 
2 xzdz on M 3 . It is easy to check that dO = 0, so 6 is a trivial differential ideal. 
Flence, by Frobenius' theorem, 9 is completely integrable. But what is the integral 
submanifold? A hint is provided by the Poincare lemma, which tells us that 9 is 
exact so that there is a function / such that 9 = df. This is equivalent to a set of 
differential equations for /, 


df 

dx 


= 2*y + z 2 , 



dz 


= 2xz, 


whose simultaneous solution is / = x 2 y + xz 2 - The equation 9 = 0 just says 
that df = 0, or / = constant. We have therefore found our integral submanifolds, 
namely the surfaces x 2 y + xz 2 = constant. The tangent spaces of these integral 
submanifolds constitute the distribution determined by 9, as they are spanned by 
vectors that are all annihilated by 9. Moreover, Frobenius’ theorem also says that 
we can find new coordinates x' , y' , z! such that the integral submanifold looks 
locally like z! = constant. (In this example we can choose coordinates x 1 = x, 
y' = y, and z! = / whenever {x, z) / (0, 0).) 

Of course, most 1 -forms 9 will not satisfy d9 = 0, so in general there is no / 
such that 9 = df. However, as long as d 9 is proportional to 9, the claim is that 
there are now two functions, / and g, such that 9 = gdf. If 9 is not identically 
zero at p then g cannot vanish at p then so again 9 = 0 implies that the integral 
surface is given by / = constant. 

This idea generalizes, and the result is sufficiently useful for us to present it as 
yet another version of Frobenius’ theorem. 


7 


Observe that the condition that 6 does not vanish identically at p is part of the hypothesis of a Pfaffian system. 
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Theorem F.7 Let &:= {9 l 9 m ] be a Pfaffian system on some neighborhood 
U. If 0 is a differential ideal then on U there exist m 2 functions g ' , and m functions 
f' such that 

m 

9 1 = J2s‘jdf j , i = l, ... ,m. (F.15) 

7=1 

Equivalently, writing 0 = ( O' ) (a column matrix), G := (g l j), and F = ( f r ) 
(another column matrix), 


® = GdF. (F.16) 

Proof By Frobenius’ theorem the distribution A annihilated by 0 is completely 
integrable, with integral submanifold X. Hence there exist local coordinates 
jc 1 , . . . , jc" such that X is given by x k+l = Ck+i, ■ ■ ■ , x n — c n , where k = n — m. 
Equivalently, X is the solution set to the differential system df l = ---— df m = 0, 
where ft := x k+J . It follows that any annihilator of A must be a linear combination 
of the 1 -forms df J for j — 1, . . . , m. □ 


F.3 Holonomicity and constrained systems 

There is one more thing that needs to be addressed, namely the relationship 
between Frobenius’ theorem and holonomicity. The story begins with classical 
mechanics. Consider the spherical pendulum, which consists of a bob of mass 
m suspended from a single point by a massless rigid rod of length a. To spec- 
ify the state of the system at any instant, we use (inverted) polar coordinates 
r,9,(j), as shown in Figure F.3. In effect, the configuration space of the system 
is a three-dimensional manifold M coordinatized by q = (r, 9, ()>). The (general- 
ized) velocity of the bob at any instant is q = (r ,9 ,</>), where the overdot means 
differentiation with respect to time. 

But not all velocities are allowed, because we are assuming that the rod that is 
rigid. Instead, we have a constraint, namely r = 0. Now, it is easy to see the 
effect of this constraint: the bob is constrained to move along a sphere of radius a 
(hence the name “spherical pendulum”). Constraints of physical systems that can 
be written directly in terms of coordinates (such as r = a) or in terms of derivatives 
that can be integrated (such as r = 0), are called holonomic constraints. 

To connect this with Frobenius’ theorem we must change our point of view 
slightly. First, we view q as a tangent vector field on M : 


9 
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Figure F.3 A spherical pendulum. 

The velocity constraint can then be written co(X) = 0, where co := dr. The vector 
fields X annihilated by co are precisely the allowed velocity vectors, and they define 
a two-dimensional distribution A on M. Frobenius’ theorem guarantees that this 
distribution is integrable, because dco = 0 {co itself constitutes a differential ideal). 
Of course, we already knew that, because we can directly integrate the constraint 
co = 0 to get the answer r = constant. In particular, the integral submanifolds of 
A are spheres of different radii, and they foliate M as expected. The motion of the 
bob is constrained to lie on one of these spheres, namely r = a. This is true in 
general: holonomic constraints lead to motions that are constrained to lie within 
integral submanifolds; such constraints are called “integrable” because they can be 
integrated (in principle). 

Nonholonomic systems, however, behave quite differently. Consider a disk 
rolling without slipping on a plane, as shown in Figure F.4. To specify the state of 
the system at any instant, we need four coordinates: x and y tell you where the disk 
touches the plane, 6 gives its direction relative to the x axis, and </> gives the angular 
location of a point on the rim of the disk relative to the vertical. The configuration 
space of the system is a four-dimensional manifold M globally coordinatized by 
q := (x,y,d,4>). 

The (generalized) velocity of the disk at any instant is <7 = (jc, y, 9, </>), and the 
condition of rolling without slipping can be written 

x = a cos 6 </>, 
y = a sinO cp, 
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where a is the radius of the disk. Again we interpret q as a vector field X on M, 
so that 


k 






The constraints can then be written as coi(X) = 002 ( A ) = 0, where 


co\ := dx — a cos 6 dcj ) , 
co 2 := dy — a sin 0 dcj). 

Thus co i and &> 2 constitute a Pfaffian system whose zero set A defines a two- 
dimensional distribution on M , but the difference now is that A is not integrable, 
because, for example, 

dco\ = a sin 8 d6 A dcj) 


fails to lie in the ideal generated by co\ and co 2 . 

It is instructive to see what all this looks like in terms of vector fields instead of 
forms. Denote an arbitrary vector field by 


X = 





+ k 


d 

90 ’ 


where /, g, h, and k are arbitrary functions of x, y, 6, and 0. Then the two 
constraint conditions become conditions on the allowed vector fields: 


coi(X) = 0 


9 9 9 9 

X = ak cos 9 hg I - h (- k — , 

dx dy d9 90 

9 9 9 9 

X = f K ak sin 6 V h \- k — . 

dx dy d6 90 


m 2 (X) = 0 =>■ 
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The two conditions must be satisfied simultaneously, so we must have 

3 3 3 3 

X = ak cos 6 \- ak sin 6 \-h \ - k — . 

dx dy 3 6 30 

(We can also obtain this result directly from the velocity constraints themselves.) 
The following two vector fields therefore constitute a basis for A : 

3 3 3 

a cos 6 f a sin 9 1 , 

dx dy dcj) 

3 

dd' 

A simple calculation shows that 

9 9 

[Xi, X 7 ] = a sin# a cos 9 — ^ span{Xi, X 2 }, (F. 17) 

dx dy 

so again we see that A is not integrable. 

What is the physical significance of the fact that A fails to be integrable? One 
answer is given by the theorem of Chow and Rashevskii, which says, in effect, 
that if one has a bunch of vector fields that are “maximally noninvolutive” then 
by following along the integral curves of those vector fields you can travel from 
any point of M to any other point of M. More precisely, a distribution A is called 
bracket-generating if, for any point p e M and any local basis {Xi, . . . , X*} for 
A p , the vector space generated by the basis vectors and all their iterated brackets 
[Xi, Xjl [Xi, [Xj, X*]], . . . coincides with T p M. 

Theorem F.8 (Chow-Rashevskii) If A is a bracket-generating distribution on a 
connected manifold M then any two points of M can be connected by a path whose 
tangent vectors lie in A. 

Proof The idea of the proof is simple enough. If all the iterated brackets span only 
a proper subspace of the tangent space then they determine a lower-dimensional 
involutive distribution; so, by Frobenius’ theorem, the motion is constrained to lie 
within a proper integral submanifold. For the details see [62] . □ 

One can check that the constraint vector fields for the disk system are bracket- 
generating. Therefore, by the Chow-Rashevskii theorem, legal motions of the 
system are not constrained to lie on any submanifold. In other words, we can roll 
the disk without its slipping around the plane and can visit every possible point 


X! := 
X 2 := 
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of the configuration space. 8 The Chow-Rashevskii theorem has applications to 
various disciplines such as thermodynamics and control theory. For example, in 
designing a robot arm or a car steering mechanism one would like to know that 
one has enough degrees of freedom to reach any point in the configuration space, 
within the constraints inherent to the system. For more details, see e.g. [44]. 

At last we are able to connect the notion of a holonomic constraint to a holo- 
nomic frame field. As should be clear by now, holonomic constraints can be 
integrated to give integral submanifolds of the distribution spanned by the holo- 
nomic vector fields arising from the constraints whereas nonholonomic constraints 
give rise to nonholonomic vector fields that determine nonintegrable distributions. 


At first sight this would seem to be impossible. For example, if the disk starts at the origin with 0 = 0 = 0, 
how could we get the disk back to the origin with the same value of 6 and a different value of 0? One way 
would be to roll the disk around circles on the plane of different radii. Similarly, one might ask how could we 
get from the origin to some other point p while leaving 6 and 0 fixed. One answer is to change 6 until the disk 
lies along the line from the origin to p , run it to p, change 0 back, then run it around a little circle if necessary 
to get 0 back to its original value. (Rotating in place satisfies the constraint of rolling without slipping.) 


Appendix G 

The topology of electrical circuits 


G.l Kirchhoff’s rules 

Electrical circuits provide an example of a simple physical system in which homol- 
ogy and cohomology play a small but important part. 1 We consider only the 
simplest case of a resistive network comprising batteries and resistors obeying 
Ohm’s law (Figure G.l), although by employing complex impedances the theory 
can be extended to steady state circuits that also contain capacitors and inductors. 
To solve the circuit (that is, to find the currents through, and voltages across, each 
element) we may appeal to Kirchhoff’s rules: 2 

(1) (junction condition) the sum of the currents into a node is zero; 

(2) (loop law) the sum of the voltage gains around any closed loop is zero. 

The junction condition embodies the law of charge conservation, which says that 
you cannot create or destroy net charge. The loop law embodies the law of energy 
conservation (because the work W done on a charge q by a potential difference 
AV is W = qAV, so if you the law did not hold, a charge could flow around a 
closed loop and gain energy ad infinitum). 


G.2 A graph model 

We can model such a circuit by means of a graph, which is just a bunch of points 

(or nodes or vertices) joined by a bunch of lines (or edges). 3 Topologically, a 

1 For more details, see [8], Chapter 12, [27], Appendix B, [32], or [78], 

“ We use the standard convention that a current into a node is positive, while a current out of a node is negative. 
Similarly, a voltage difference between two points is positive if we gain voltage in going from one to the other, 
and negative if we lose voltage. 

3 The word “line” does not imply a rectilinear edge, although in fact every graph can be drawn in R 3 with 
straight lines for edges in such a way that edges only meet at their endpoints. One can also view (simple) 
graphs abstractly as a set of vertices together with a subset of unordered pairs of distinct vertices. For more 
about graphs, see [35] or [89]. 
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Figure G. 1 A simple resistive circuit. 
2 



Figure G.2 An oriented graph. 

graph is the same thing as a one-dimensional simplicial complex. For simplicity, 
we will assume that the graph has only one connected component. So, start with 
a connected graph G with n nodes and m edges that models the topology of the 
circuit. Label the nodes, then orient each edge arbitrarily. 4 (Figure G.2.) The vector 
space of i -chains on G is denoted C, . The nodes are a basis for Co and the edges 
are a basis for Ci, so dim Co = n and dimCi = m. The boundary operator 3 : 
Ci -> Co is defined by d{p, q) = q — p, extended by linearity. With respect to 
the natural basis, 3 is represented by an n x m matrix (called the vertex-edge 
incidence matrix). For the graph in Figure G.2, 

/-I 1 0 0 0 1 \ 

10 1-10 0 
0001 - 1 - 1 ' 

V o — 1 — 1 0 1 0/ 


4 


As with simplices, it should be clear from the context whether one is considering G to be oriented or not, so 
we do not distinguish between the two cases notationally. 
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We assign a current i to each (oriented) edge ej and represent the currents alto- 
gether by the 1-chain I — W l j e j- Kirchhoff’s junction condition is then elegantly 
expressed by the requirement that I be a 1-cycle: 

3 / = 0 . 

For our example this equation reads 

/-I 1 0 0 0 1 \ 

1 0 1-1 0 0 

0 0 0 1 -1 -1 

VO-1-1 0 1 0/ 

Next, we seek a similar statement about voltages. Flere we are guided by the 
physics. It makes sense to assign an electric potential <t> to each vertex, so d> is 
naturally a map from the vertices to the real numbers. We extend it by linearity, so 
that d> becomes a 0-cochain. As before, we denote the space of /-cochains by C'. 

By analogy with the continuous case we denote the natural pairing between 
chains and cochains by the integral symbol. Thus, if c = and / = V ■ ajej, 

where {e 7 } is the dual basis to {e /}, then 

/ := ( G.2 ) 


lh \ 

*2 
h 

*4 

h 

\h 


= 0 . 


(G.l) 


The notation looks even more natural if we recall the cohoundary map 3* : C' 
C' +1 . If c is an /-chain and g is an (Z — l)-cochain then, in our notation, 



(G.3) 


a formula that ought to look moderately familiar. 

Indeed, if you think of 3* as a discrete analogue of the differential operator d, 
and if you recall that the electric field E is the gradient of the electric potential d>, 
then you ought to be able to write something like E = — 3*d>. But, in our discrete 
world, 3* is a difference operator rather than a differential operator so instead it 
makes more sense to write 


V = -3*<f>, 


(G.4) 
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where V is a 1 -cochain representing the voltage differences across each edge. To 
see that this choice is indeed the right one, we “integrate”. If e = (pc/) is an edge 
then 

f V = - f = - f = -[$(<?) - <*>(/>)] = Q(j>) - $(<?), (G.5) 

J e J e J de 

which is exactly the voltage difference across the edge e, consistently with the idea 
that current runs downhill in the direction of the arrow, from high potential to low 
potential. 

At this point we must confess that we have pulled a fast one, for (G.4) is nothing 
other than Kirchhoff’s loop law in disguise. The reason is that this law reads, in 
our notation, 


Jv = 0 (G.6) 

for every cycle z. If (G.4) holds then (G.6) holds, because in that case 

f V = - [d* <& = -/* = 0 

J Z J Z J dz 


as dz = 0. But the converse holds as well. In essence this is the analogue of the de 
Rham theorem, which says that a closed form is exact if and only if all its periods 
vanish."’ 

Of course, we would prefer a more direct proof, and in this simple case such a 
proof is not difficult. First, we introduce a bit of slightly nonstandard terminology. 
Let p and q be two nodes of G. A path from p to q is an alternating sequence 


(Po, iPo, Pi), Pi, {Pi, Pi}, Pi---, {Pk-l,Pk}, Pk ) 


of vertices and edges, with p 0 = p and p k = q, where successive vertices are dis- 
tinct. 5 6 To every such path in G we assign the 1-chain (element of C\) (p,- , Pi+\) 

(which by abuse of terminology we also call a path). The boundary of this path is 
just q — p. A cycle of G is just a path that starts and ends at the same point; as an 
element of C\ it has no boundary, so it is a 1 -cycle. 7 


5 The 1-cochain V is “closed”, meaning that 3* V = 0, simply because there are no 2-chains in G and so there 
are no 2-cochains either. 

6 In particular, although it is not really necessary for anything that follows, we allow our paths to backtrack, to 
cover the same edge many times, etc. We do this simply to agree with the definition of a path in a manifold, 
which need not be an embedded curve. Graph theorists use the word “path” to mean a path with no repeated 
vertices; this forces all the edges to be distinct as well. 

7 Some graph theorists would call our cycle a circuit and would reserve the word “cycle” for a circuit with no 
repeated vertices. 
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Let V be an arbitrary 1 -cochain. Fix a point p on G, and let c be a path in G 
starting at p and ending at some arbitrary point q. Define a 0-cochain <E> by setting 
<3>(p) := 0 and 


<*>(?) 



(G.7) 


The claim is that <J> is well defined, in the sense that it does not depend on the path 
chosen. If d were some other path from p to q then we would have 

V = f V = 0, 

Jc'—c 



because d — c is a cycle. It follows that O is a well-defined 0-cochain. Moreover, 
j V = <b(p) - <b{q) = - j %*<!>, 

so that 


(V + 3*d>) = 0. 

By linearity it follows that the integral over every edge vanishes, and hence that 
V = — 3*d>, as promised. 

We close this section with an important observation, namely that the zero- 
dimensional homology of any path connected space is one-dimensional. In par- 
ticular, as any connected graph is path connected we have dim H 0 (G ) = 1. Once 
again, it is worth proving this directly for graphs, as the general case is similar. 

We claim that 


Bo(G) = {E i a iPi :E l a i =0}. 

Suppose that on the one hand b g Bq. Then there exists a 1 -chain c = 

T,ij GjiPi, Pj) with aj = Cji and 

b=dc= J2ij c ij (Pj + Pi) = Ei(E; Gj Pj) + E;(E; c ij Pi) 

= E/(E./ cu Pj) + Ei(E; c jiPj) = Ei/oy + Cjj)pj 

= EjajPj’ 

where cij := E; ( c ij + c ; c). Thus 

Ey fl j = Ey( c y+ c j/) = 0. 

On the other hand, suppose that b = a, p, is a 0-chain with a i = 0. 
Choose a fiducial point p g G. By connectivity we may choose a family { y, } of 
paths, where y, is a path from p to /;,. Define the 1 -chain c = a, Yi ■ Then 

dc = a i(Pi ~ P) = Ei a i Pi ~ P Ei a > = E, a iPi = b . 

and the claim is proved. □ 


G.4 Ohm’s law 


301 


Now, every zero chain is a cycle (because there are no lower-dimensional chains) 
and so Co = Z 0 . Consider the surjective map x// : Co — »■ M given by JA a, p, i-> 
JA a,. This gives an exact sequence, 

0 > Bq ■> Zq ^ M > 0, 

where i is the inclusion map. We conclude that dim(Z 0 /2?o) = 1. 

G.3 Tellegen’s theorem 

We observe here that a simple consequence of Kirchhoff’s laws is Tellegen’s 
theorem, which is just another expression of the law of energy conservation. If 
the voltages V = £A Vie, and currents 7 = £A Ijej obey Kirchhoff’s laws 
V = — 3*<E> and 37 = 0 then the total power dissipated by the circuit is 

V i j Vj = f V = — f 3*d> = — f <b = 0, (G.8) 

. Ji Ji Jdi 

so we obtain the result that an isolated circuit conserves energy. Of course we 
ought to have expected this, considering that we derived Kirchhoff’s laws assuming 
energy conservation, but it is comforting nonetheless. 

G.4 Ohm’s law 

We are finally in a position to write down the “equations of motion” for our system, 
which in our case is just Ohm’s law V = I R. But we have defined V as a 1- 
cochain and 7 as a 1 -chain, so R cannot just be a real number. Once again we 
draw inspiration from physics. In a bulk medium Ohm’s law is j = a E, where 
j is the current density vector, a is the conductivity tensor, and E is the electric 
field vector. Equivalently, E = pj , where p = (cr ) — 1 is the resistivity tensor. By 
analogy, then, we write 

V — ZI + v, (G.9) 

where V is the vector of voltage differences, Z is the impedance matrix, 7 is the 
vector of currents, and v is a vector representing the voltages contributed by any 
batteries on the edges. We may think of the impedance matrix as the matrix rep- 
resentation of a linear transformation Z : C i — > C 1 . In our case the impedance 
matrix is diagonal, the diagonal elements being the resistances across each edge, 
but, as mentioned above, it could just as well contain complex impedances to 
account for capacitors and inductors. 

We must now somehow combine Kirchhoff’s rules and Ohm’s law to solve the 
circuit. It is not clear at this stage that it can actually be done. But it can, and this is 
the content of Kirchhoff’s theorem. Actually, there are many ways to solve for the 
currents and voltages, although will discuss just one. 


302 


The topology of electrical circuits 


By way of motivation, we may recall that, when we were using Kirchhoff ’s laws 
to solve for the currents in a circuit, we only needed to use a few loops rather 
than all of them. The key point is that most of the information in (G.l) and (G.6) 
is redundant. The general rule of thumb, taught in elementary physics classes, 
is to choose just enough loops that every branch of the circuit lies in some loop 
and then add in as many independent junction conditions as are needed to get an 
equal number of equations and unknowns. Formalizing these intuitions leads to 
some really fun graph theory, which we can only discuss in the briefest possible 
fashion. 


G.5 Spanning trees and cycle bases 

If a graph contains no cycles then it is called a forest, and if it only has one con- 
nected component then it is a tree (See Figure G.3) A bridge is an edge whose 
removal disconnects G. A spanning tree of a connected graph G is a subset of 
G that contains all the vertices of G and some edges of G and is a tree (See 
Figure G.4.). 

EXERCISE G.l Show that a connected graph is a tree if and only if every edge is a 
bridge. 

Theorem G.l A connected graph G with n vertices is a tree if and only if it has 
72 — 1 edges. 

Figure G.3 A forest of trees. 


V V 




Figure G.4 A graph (upper left) and three of its spanning trees. 
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Proof The assertion is immediate for a single edge with n = 2. So, assume that 
G is connected, with n vertices. If G is a tree then removing an edge gives two 
disconnected pieces. Let n h i = 1, 2, be the number of vertices in each piece, so 
that n = n\ + 7i 2 . Evidently n, < n, i = 1, 2, so by induction on the number of 
vertices each piece has /?, — 1 edges. Accounting for the edge that was removed, the 
number of edges in the original graph is therefore (n i — I ) + (/i 2 — I ) + I — n — 1. 

Conversely, suppose G has n vertices but is not a tree. By successively removing 
edges that are not bridges, we must eventually arrive at a spanning tree, which, as 
we have shown, must have n — 1 edges. Clearly, G must have had more than n — 1 
edges to begin with. □ 

Remark There is a really wonderful theorem, called Kirchhoff’s matrix tree 
theorem, which gives a simple formula for the tree number, namely the number 
of spanning trees in a connected graph. The Laplacian matrix corresponding to 
the graph G is defined to be the n x n matrix L = 33*, where we identify C, 
with C‘,i = 1, 2, so that 3* is just the transpose of 3. (This is just a discrete 
version of the Laplace-de Rham operator (8.96).) It is not difficult to show that 
L is independent of the choice of orientation of the edges. More remarkable is 
the fact that all the cofactors of L are equal, and they are all equal to the tree 
number! (For the proofs see [9] or [89].) Try it for the graph in Figure G.4. Answer: 
15. (The matrix tree theorem can be generalized to higher-dimensional simplicial 
complexes. See eg. [24].) 

If G is a connected graph and T is a spanning tree not equal to G, an edge in G 
but not in T is called a chord. If we adjoin a chord to a spanning tree then we get a 
cycle of G, called an elementary cycle. All the cycles obtained in this fashion are 
linearly independent as elements of Ci , and therefore of Z\ , because each contains 
an edge not contained in the others, namely, its chord. (See Figure G.5.) 

The claim is that the elementary cycles form a basis for Z\. First, we count the 
number of chords of a spanning tree. But that’s easy. The graph has n vertices and 
m edges, while the tree has n vertices and n — 1 edges, so there must be m — n + 1 
chords. 

Next, consider the chain complex 

C\ Co 0. 

The cycle rank of a graph G, rk G, is the dimension of the cycle space Z\ = 
ker3i. By the rank-nullity theorem, dimCi = rk 3] + dimkerSi, which gives 
m = dim Bo + rk G , where m is the number of edges of G and Bq is the space 
of 0-boundaries. But the graph is connected, so 1 = dim Hq = dimZo — dim Bq. 
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Figure G.5 (a) A graph, (b) a spanning tree, and (c), (d), and (e) its elementary 
cycles. The broken lines are the chords associated to the tree. 


As everything in Co is killed by 3 0 , it follows that Z 0 = Co so we get 1 = n — 
dim B q . Combining these formulae gives 

rk G = m— n + 1. (G.10) 

This equals the number of chords of G (relative to any spanning tree), so the 
elementary cycles really do form a basis for Z\ . 


G.6 The mesh current method 

At this stage it should come as no surprise that the number of independent loops 
that we must use when applying Kirchhoff’s laws is precisely the cycle rank of G. 
First, pick a spanning tree of G and then write down the corresponding elementary 
cycles. For example, for the circuit modeled by Figure G.2 we obtain the cycles 
shown in Figure G.5. In the order (c), (d), and (e) they are 


z i = e i + C 2 ~ £3 > 

2 2 = C3 + C4 + C5, 

Z3 = e \ + £4 + ^ 6 - 


Next, one imagines there are currents j, flowing around the elementary cycles 
H, called mesh currents. These are packaged into a 1-cycle J = JT fzi- The 
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inclusion map p : Z\ — > Cj relates the mesh currents to the edge currents. For our 
circuit, the matrix representation of p is 


giving 


< 1 0 1 ^ 

1 0 0 

-1 1 0 

0 1 1 

0 1 0 

V 0 0 i / 



( j\ + h ^ 

h 

—j i + 7*2 

7*2 + 73 
7*2 

V * 


Observe that Kirchhoff’s junction condition is automatically satisfied for this vec- 
tor of currents, because p takes cycles to cycles, so 3 pj = 0. The reader may wish 
to check this for our circuit, using matrix multiplication to verify that dp = 0. 

As there are no 2-chains in G, no 1 -cycle can be a boundary of something. This 
means that Z\ is the same thing as H \ , the first homology group. In particular the 
inclusion map may be viewed as a map p : H\ — > C\, and it induces a dual map 
7r : C 1 — > H 1 . In terms of the canonical dual bases the matrix representation of n 
is just the transpose of the matrix representation of p, so, in our example, 


/I 1 —1 0 0 0\ 

7T = 0 0 1110 

V i o oioi/ 


The dual of 3 p is it'd* and, as the former vanishes, so must the latter. 

At last we are in a position to write down an algorithm for solving circuits. 
Write the currents I in terms of the unknown mesh currents J according to I = 
pj, so that Kirchhoff’s junction condition is satisfied, write V = — 3*<F so that 
Kirchhoff’s loop law is satisfied, and substitute into Ohm’s law (G.9) to get 


ZpJ = -3*<f> - v. 


Apply tv to both sides and use tt 9* = 0 to obtain 


TtZpJ = — 7TV. 


(G.l 1) 
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If 7t Zp is invertible 8 then we get 

J = —{7tZp)~ l 7TV. 

Once one has J, one can obtain I from I = pj and V from V = ZI + v. 
What does all this look like for the circuit in Figure G. 1 ? We compute 


(G.12) 


nZp = 


1 1 -10 0 0 

0 0 1110 
10 0 10 1 


/ Ri 0 
0 R 2 
0 0 
0 
0 
0 


0 0 o o\ 


< 1 0 1 N 

0 0 0 0 


1 0 0 

R 3 0 0 0 


-1 1 0 

0 R 4 0 0 


0 1 1 

0 0 r 5 0 


0 1 0 

0 0 0 o y 


\ oo\) 



-Ri 

R 2 + + Rs 

R4 



For concreteness, let’s take R { = R 3 = R 5 = 5 Q and R 2 = Ra — 10 £2. Then 

TV Zp — 


20 -5 5 

-5 20 10 

5 10 15 


and 


(TrZp)- 1 = 


8/105 1/21 -2/35 

1/21 11/105 -3/35 

-2/35 -3/35 1/7 


Also, setting Vo = 10 volts, say, we get 9 


TTV = 



0 \ 
0 
0 
0 
0 

- 10 / 



8 There are some subtleties when Z contains complex impedances, but one can show that nZp is always 
invertible for purely resistive circuits. See [8]. 

9 We must set v$ = —10 volts because, by our convention, voltage differences are positive if we go from high 
potential to low potential when following the arrows. 
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and therefore 



8/105 1/21 -2/35 

1/21 11/105 -3/35 

-2/35 -3/35 1/7 



I = pj = - 
7 


and 


/ 1 0 1 \ 
1 0 0 
-1 1 0 
0 1 1 
0 1 0 


V = ZI + v = - 
1 



o 

o 


0 0 0 o o\ 


( 6 \ 

10 0 0 0 0 


-4 

0 5 0 0 0 


-2 

0 0 10 0 0 


4 

0 0 0 5 0 


-6 

0 0 0 0 o y 


l 10 / 


+ 


( 6 \ 
-4 
-2 
4 

-6 

V 10 / 


0 \ 
0 
0 
0 
0 

- 10 / 


1 

7 



< 30 ^ 
-40 
-10 
40 
-30 

\ —70 / 


where currents are in amperes and voltages are in volts. 


Appendix H 

Intrinsic and extrinsic curvature 


In Section 8.5 we noted that the cylinder is a flat manifold, in the sense that 
its curvature vanishes. But this conflicts with our intuition, which tells us that 
a cylinder is obviously curved. These two apparently contradictory facts are 
reconciled by distinguishing two types of curvature: intrinsic and extrinsic. All 
the curvatures we have talked about elsewhere in this book are intrinsic - they 
depend only on the manifold itself. In contrast, extrinsic curvature has to do with 
how a manifold is embedded in a higher-dimensional space. The cylinder has 
zero intrinsic curvature but nonzero extrinsic curvature when viewed as a sur- 
face in M 3 . In this appendix we investigate how these two types of curvature are 
related. 

Let M and N be smooth geometric manifolds of dimensions m and n, respec- 
tively, with m < n, and let / : M — > N be an immersion (so that M is a 
submanifold of N). Let g denote the metric on N and h = f*g the induced metric 
on M. At every point p the tangent space to N naturally breaks up into a direct sum 
T p N = T p M © 7' ;j M, where T p M is the orthogonal complement to T p M (relative 
to the metric g); T p M is called the normal space to M at p and its elements are 
normal vectors. The union of all the normal spaces to M is T L M, the normal 
bundle of M. A normal vector field is a smooth section of the normal bundle, 
namely, a smooth vector field of N that is everywhere normal to M. 

Let X and Y be tangent vector fields on M . Then, by definition, h(X, Y) p = 
g(f*X, f*Y) f( p ). But the immersion / restricts to the identity on M, so we can 
identify p and f(p), in which case /* : T p M T p M is just the identity map. 
It follows that h(X, Y) p = g(X, Y) p . If either X or Y had a normal component it 
would make no sense to write h(X, F). Therefore we may safely use g to denote 
the metric on both M and N, which is the standard convention anyway. 

Now let V denote and V denote the Levi-Civita connections of M and N, 
respectively. The Gauss formula relates these two connections. 
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Theorem H.1 Let X and Y be vector fields tangent to M. Then 

V X Y = V x Y + a(X, Y), (H.l) 

where a(X, Y) e T ± M. 

Proof The covariant derivative operator V maps vector fields on N to vector fields 
on N so, technically, to define X x Y we must first extend X and Y to vector fields X 
and Y on N. But the whole point of the covariant derivative is that it depends only 
on what vector fields do at a point and so is independent of any local extension. In 
other words, 

Vf ? = V Z F. (H.2) 

Note also for future use that, on M, [X, F] = [X, F] because the Lie bracket 
preserves tangency. 

Now, X x Y lies in the tangent space to N so we can break it up into two pieces, 
one tangent to M and the other normal to M. Let P : T p N — »■ T p M be the lin- 
eal - projection onto T p M, and define /3(X, Y) p := P(V x Y) p and ct(X, Y) p := 
(1 — P)(y x Y) p . We must show that /3(X, F) = V X Y, where V is the Levi-Civita 
connection on M obtained from the induced metric. To do this, we must show that 
axioms (C1)-(C3), (C7), and (C8) of Sections 7.2 and 8.2 are satisfied. 

Axioms (C1)-(C3) follow simply from the corresponding properties of V 
because P is linear. For example, axiom (C2) applied to V gives, for any vector 
fields X, F and function / on M, 

P(X, fY) = P(y x fY) 

= P(fV x Y) + P((Xf)Y) 

= ff(X, F) + (X/)F. 

(The term (X/)F is already tangent to M.) To show axiom (C7) we must show 
that, for any vector fields X, F on M, 

j8(X, F) — /f(F, X) = [X, F]. (H.3) 

But V is torsion free, so 

0 = VyF — VyX — [X, F] 

= V X F - VyX - [X, F] 

= (f(X, F) + a(X, F)) - 0 8(F, X) + a(Y, X)) - [X, F], (H.4) 

The tangential component of (H.4) is just (H.3), while the normal component of 
(H.4) yields 


a(X, F) = a(Y, X). 


(H.5) 
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Again, let X, Y , and Z be vector fields on M , with corresponding (arbitrary) 
extensions X, Y, and Z. By the metric compatibility of V, 

Xg(Y, Z) = g(V x Y, Z) + g(Y , V*Z). 

Using (H.2) and the fact that, by construction, g(X, Y) p = g(X, Y) p for all p e M, 
we get 

Xg(Y, Z) = gWxY, Z) + g(Y, V X Z ) 

= g(/3(X, Y) + a(X, Y), Z) + g(Y, /3(X, Z) + a(X, Z)) 

= g(P(X,Y),Z) + g(Y,P(X,Z)), 

because the normal space of M is orthogonal to the tangent space of M. Hence 
axiom (C8) holds for ji(X , Y). By the uniqueness of the Levi-Civita connection, 
we conclude that /3(X, Y) = V X Y , as promised. □ 

EXERCISE H.l Show that a is function linear. 

By virtue of equation (H.5) and Exercise H.l, a defines a symmetric tensor field 
on M with values in the normal space of M. It is called the second fundamental 
tensor of M and depends on the chosen embedding of M into N. If the codimen- 
sion of M (namely, n — m ) is unity, M is called a hypersurface. Iff is a field of 
unit normals to the hypersurface then 

a{X,Y) = n f (X,F)f, (H.6) 

where IE (X, Y) is sometimes called the extrinsic curvature tensor or the second 
fundamental form of the hypersurface M . 1 

We can use the Gauss formula (H.l) together with the definition of the curvature 
tensor (7.36) to relate the intrinsic and extrinsic Riemann curvatures of M and 
N. Let R (respectively, R) denote the Riemann curvature tensor obtained from V 
(respectively, V). If X, Y , and Z are tangent vector fields on M then, by considering 
tangential and normal projections, we obtain the equations of Gauss and Codazzi- 
Mainardi: 

P(R(X, Y)Z) = R(X, Y)Z + P(V x a(Y, Z) - V y a(X, Z)) (H.7) 

and 

(1 - P)(R(X, Y)Z ) = a(X, V Y Z) - a(Y , V X Z) - a([X, 7], Z) 

+ (1 - P)(V x a(Y, Z) - V F a(X, Z)). (H.8) 

EXERCISE H.2 Derive (H.7) and (H.8). 

The first fundamental form of M is just the metric (viewed as a quadratic form). 


1 
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These equations assume their more traditional forms when M is a hypersurface 
of N with unit normal vector field §. The Weingarten map or shape operator 
Sj : T p M — T p M is given by 

Ss(X) = -P(V Z |), (H.9) 

where X € F(T M) is a vector field tangent to Mr Actually, the projection operator 
P is redundant here because £ is a unit vector field so Vxf is already tangent to 
M, as the following computation demonstrates: 

1 = $(£,$) =» 0 = Xg(£, £) = 2g(V x $, $). 

EXERCISE H.3 Prove Weingarten’s formula 

g(S S (X), Y) = g{ct(X, Y ), £) = (X, Y ). (H.10) 

Hint: Use the metric compatibility of V and the orthonormality of the tangent and 
normal spaces. 


Remark The Weingarten map has a useful geometrical interpretation when M is 
a hypersurface in Euclidean space N = M". Let g be the ordinary Euclidean metric 
(so that V is just the ordinary gradient), and let § = f UJ, be the hypersurface unit 
normal. Then g(£, f) = ^W(§ y ) 2 = 1, so we get a natural map E : M — > S n ~ l , 

given by p i-»- (£ 1 (/;). £"(/?)), called the Gauss map or sphere map. We 

claim that, under these circumstances, the Weingarten map can be viewed as the 
negative of E*, the differential of the Gauss map. 

To see this, first note that E* maps T p M to 7A (/)) ,S'"~ I . But T^ p) S n ~ l and T p M 
are parallel affine hyperplanes (because each consists of the set of all vectors 
orthogonal to §(/?)), so we may identify them (up to translation). Let X = X k dk be 
a vector field on M and / a smooth function on M ,! . Then 


2*(*W )(/) = X p (E*f) = X p {f o E) = X p {f{$\ . . . , T)) 

dx k 


= !*(,) X 

1 3f/ 


3 (P) 


= (Vx(?)) J 


— / 
p 


3 (p) 


Stripping / from both sides and using the identification of tangent spaces allows 
us to write 


H*(X) = V*(£). 


(H.ll) 


2 Technically, we should define ( X ) = — P(V^^) where X and £ are local extensions of X and £ to N but, as 

before, the properties of covariant derivatives ensure that S is independent of the manner in which the vector 
fields are extended. The minus sign is inserted to make Weingarten’s formula (H.10) look nice. 
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Looking at (H. 10) we see that, because the right-hand side is symmetric in X and 
Y , so is the left-hand side. That is, the shape operator is a self-adjoint linear oper- 
ator so it can be diagonalized by an orthogonal transformation. Let {e \ , . . . , e m } be 
a basis of T p M consisting of orthonormal eigenvectors of S%, with corresponding 
eigenvalues {A i , . . . , X m }. The eigenvectors e, are called the principal directions 
(of the embedding) and the eigenvalues A,- are called the principal curvatures. 
The product of the principal curvatures (i.e., the determinant of the shape operator) 
is called the Gauss-Kronecker curvature of M , and the average of the principal 
curvatures (i.e., the trace of the shape operator divided by m) is called the mean 
curvature of M. Gauss’s theorema egregium is the statement that, in the case of 
a two-dimensional surface embedded in M 3 , the Gauss-Kronecker curvature (the 
product of the principal curvatures) is precisely the Gaussian curvature (the sec- 
tional curvature) and is therefore a purely intrinsic quantity, independent of the 
particular embedding. (See Exercise H.5 below.) 

EXERCISE H.4 Let W, X, Y, and Z be tangent vector fields on M, a hypersurface 
in N with unit normal field £ . Show that the Gauss and Codazzi-Mainardi equations 
become 

g(R(X, Y)Z, W) = g(R(X, Y)Z, W) 

+ n ? (X, Z)H f (T, W) -n ? (T, Z)fl ? (Z, W) (H.12) 

and 

g(R(x, Y)z, |) = %(X, VyZ) - n t (T, v x z) - ax, y], z> (H. 13 ) 
= g(Vz^(F) - VyS^X) - s f ([X, Y]), Z), (H.14) 

respectively. 

EXERCISE H.5 Prove Gauss’s theorema egregium when M is a surface in R 3 . by 
showing that the determinant of the shape operator is the Gaussian curvature. 

Hint: Choose X, Y, Z, and W appropriately in (H.12). 

EXERCISE H.6 Assume that M is a hypersurface in Euclidean space. Choose local 
coordinates x 1 on M such that the components of the metric tensor and second 
fundamental form of M are 

gi j=g(di,dj ) and bij = II (9/ , dj ) , 

respectively. (As usual, 3; := d/dx l .) Show that the Gauss and Codazzi-Mainardi 
equations can be written as 


Rijkt — bikbji - bubjk 


(H.15) 
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and 


b tj ,i - btij = b jm r m u - b im T n ij, (H.16) 

respectively. (The Christoffel symbols are those of the metric-compatible connection 
V on M. Recall that the comma denotes a partial derivative. Hint: Show that bjj = 
Sij where S( 3,) = S J ,3 j, and indices are raised and lowered by using the metric. 
Recall the results of Exercises 8.31 and 8.34. 


EXERCISE H.7 Let M — E be a two-dimensional surface immersed in IV = R 3 
via the parameterization 

a : (m, v ) —*■ (x(u, v), y(u, v), ziu, v)), (H. 17) 


and set a u da /du and a v := da /dv. The unit normal vector to E is defined to be 

a,, x a v 

n = nr. (H.18) 

\Wu x a v \\ 

where “x” denotes an ordinary cross product and ||X|| is the ordinary Euclidean 
norm of X. 

(a) Show that the second fundamental form of E can be written 3 

H = J du 2 + 2K du dv + L dv 2 , (H.19) 

where 


J = —g(n u , a u ) = gin , a uu ), (H.20) 

K = —g{n v , a u ) = g{n, a uv ) = gin, a vu ) = - gin u , a v ), (H.21) 

L = -g(n v , a v ) = gin, a vv ). (H.22) 


and the subscript variables are shorthand for differentiation with respect to that 
variable. For example, a u = da /du and a uu = d 2 a/du 2 . 

Hint: The difficult part of this problem is just translating the formal differential 
geometric language into the traditional language. In other words, it is mostly 
a problem of notation. The surface E is the image of the map a : D — > R 3 , 
where D is a domain in R 2 with coordinates (u, v). The two vectors 3„ := 3/3 u 
and 3„ := d/dv span the tangent space to D, so their images er*(3 u ) =: X and 
<r*( d v ) =: Y span the tangent space of E. Note that if / : E — > K then 

Xf = a*(3„)/ = id u )ia*f) = fiaiu, v)) u 
_ df dx df 3 y df dz 

dx du + dy du + dz du 


The traditional notation is e, f, and g rather than J, K, and L, but we wish to reserve g for the metric. However, 
K is usually reserved for the Gaussian curvature but in this appendix we use K for a component of the second 
fundamental form. 
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V 8y 9z\ 

\dx ’ dy ’ dz J \du’ du’ du J 
= grad f • g u — g u grad /, 

where the dot denotes the usual Euclidean dot product. Dropping / gives 

X = (cTuYdi. 

Similarly, Y' = (a v )‘ and £ = n' 3,-. Finally, note that the second fundamental 
form appearing in this problem is really the pullback of the second fundamen- 
tal form as we have defined it, even though, as is customary, we use the same 
notation for both. Thus, for example, we have 

J = (a*n)0„, 3„) = n(a*0„), <t*( 3 m )). 


(b) Using Weingarten’s formula (H.10), show that the shape operator (H.9) can be 
written 


S = {EG - F 2 )- 1 


/ GJ - FK 
\EK - F J 


GK - FL\ 
EL- FKJ’ 


(H.23) 


where E , F, and G are the coefficients of the first fundamental form (metric) 
of E: 


1= Edu 2 + 2Fdudv + Gdv 2 . (H.24) 


Hint: You may wish to refer to Exercise 8.47. 

(c) Show that the Gaussian curvature (the product of the principal curvatures) can 
be written as the ratio of the determinants of the second and first fundamental 
forms: 


detH JL-K 2 
~ detT _ EG- F 2 ' 


(H.25) 


(d) The coordinates u and v on E are called isothermal if 


g{o u , cTu) = gUb, (Tv) and g{a u , er„) = 0, (H.26) 

or equivalently, if the first fundamental form of the surface can be written 

l = X 2 (du 2 +dv 2 ) (H.27) 

for some positive function X 2 = g (<r u , er, ( ) = g(cr v , a v ). 4 Show that if the surface 
admits an isothermal parameterization then 

a uu + (Tvv = 2 X 2 H, (H.28) 


where H — (X\ + 7-2) u /2 ( H is sometimes called the mean curvature vector). 
Hint: Differentiate the isothermal equations (H.26). 


4 More generally, a metric of the form gjj = (in any dimension) is said to be conformally flat. 
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(e) A surface for which the mean curvature vanishes is called a minimal surface. 5 
Show that the catenoid, given parametrically by 

a(u,v) = (a cosh v cos u, a cosh v sin w, av), 0 < u < 2 n, — oo < u < oo, 
is a minimal surface. Hint: Is the parameterization isothermal? 

Remark The word “isothermal” comes from the word “isotherm” and means “at 
constant temperature”. If tp is the scalar field representing the temperature at any 
point then the equation governing its behavior (the heat equation) is ip = K 2 V 2 <p, 
where k is the thermal diffusivity, the dot indicates a time derivative, and V 2 = 
3 2 + 3 2 + 3r is the ordinary Laplacian. In the steady state the time derivative 
vanishes, so the temperature satisfies V 2 <p = 0; this implies that tp is a harmonic 
function and that the level surfaces of tp are isotherms. 

If we have a minimal surface, so that the mean curvature vanishes, then, if the 
parameterization is isothermal (and such a parameterization always exists; see [73], 
p. 31), by (H.28) each component x, (u, v) of a is a harmonic function in parameter 
space ((u. v) space). The level curves of the coordinate functions, i.e., x,(w, v) = 
constant, are in some sense “coordinate isotherms”, whatever that means. But this 
is not the reason for the terminology. 

Instead, suppose that we have a planar two-dimensional object (with constant k), 
described parametrically by a(u, v ) = (x{u, v), y(u, v )), and suppose that heat is 
flowing through the object. Gabriel Lame showed [48] that the lines of constant u 
(respectively, v ) are the isotherms and that the lines of constant v (respectively, u) 
are the adiabats (lines of heat flow through the system in the steady state) if and 
only if the coordinates u and v are “isothermal” in the sense that o u ■ a u = o v ■ cr v 
and ct u ■ o v = 0, where is the usual Euclidean inner product. 

EXERCISE H.8 A cylinder of radius a is given the following parameterization: 

a(u, v ) = (a cos u, a sinn, v), 0 < u < 2 it, — oo < v < oo. 

Show that the intrinsic curvature (as measured by the Gaussian curvature) is zero 
but that the extrinsic curvature (as measured by, say, the mean curvature) is nonzero. 
This confirms our intution that a cylinder embedded in K 3 is intrinsically flat but 
extrinsically curved. Hint: Apply the results of Exercise H.7. Remark: A surface can 
have vanishing mean curvature, yet still be curved in the extrinsic sense, as long as 

5 The origin of this terminology comes from the Belgian physicist Joseph Plateau who, motivated by experi- 
ments with soap films on wire frames around 1 850, asked the question whether there was a unique surface of 
minimal area whose boundary was some given closed curve (Plateau’s problem). A surface of minimal area 
is a minimal surface, so that accounts for the interest in the subject. (Technically, a vanishing mean curvature 
only implies a zero first-order variation in the surface area when it is subjected to small normal displacements 
(see e.g. [22] or [73]), so the minimal surface condition does not necessarily imply minimal area although in 
practice it usually does.) 
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at least one principal curvature is nonvanishing. Moreover, in higher dimensions one 
must verify that the entire Riemann tensor is zero in order to verify that a manifold 
has no intrinsic curvature. 


EXERCISE H.9 Enneper’s surface (see the cover illustration) is a classic example 
of a minimal surface in K 3 . A standard parameterization is given by 


er(n, v ) = 



3 3 

li 2 V 2 2 

+ UV , V — — + VU , U 

3 3 



(«, v) e R 2 . 


Remark: The image on the front cover reveals only a fragment of Enneper’s surface; 
the entire surface has self-intersections. See [22], p. 205. 


(a) Show that the coefficients of the first fundamental form are 

E = G = (1 + u 2 + v 2 ) 2 , F = 0. 

(b) Show that the coefficients of the second fundamental form are 

7 = 2 , K = - 2 , L = 0 . 

(c) Show that the principal curvatures are 

2 2 
= (1 + u 2 + v 2 ) 2 and l2 = _ (l + u 2 + v 2 ) 2 ’ 

and thereby conclude that the surface is indeed minimal. Remarks: It’s easy to see 
that the surface is minimal; this is so because the parameterization is isothermal 
and the coordinate functions are harmonic. Also, Osserman has shown ([73], p. 
87) that, of all the complete regular (i.e., immersed) minimal surfaces in R 3 , only 
the catenoid and Enneper’s surface have the property that their Gauss maps are 
injective. 


EXERCISE H.10 Given a smooth function f (x , y) on R 2 , its graph z — f{x, y ) 
determines a two-dimensional surface £ in R 3 . 


(a) Find the coefficients of the first fundamental form of £ in terms of / and its 
derivatives. 

(b) Find the coefficients of the second fundamental form of £ in terms of / and its 
derivatives. 

(c) Compute the Gaussian curvature of £ . 

(d) Write down a differential equation that must be satisfied by / in order for the 
surface to be minimal. 
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action, see group action 
active transformation, 69 
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adjoint map, see map, adjoint 
adjoint matrix, see matrix, adjoint 
adjoint representation 
of a Lie algebra, 109 
of a Lie group, 109 
adjugate, see matrix, adjugate 
affine 

independence, 141 
connection, 189 
hyperplane, see hyperplane 
parameter, 214 
subspace, 2 
algebra 
exterior, 41 
graded, 31 
symmetric, 52 
algebraic multiplicity, 24 

alternating symbol, see Levi-Civita alternating symbol 
alternating tensor, see tensor, antisymmetric 
annihilator, 9 

anti-Hermitian matrix, see matrix, anti-Hermitian 
anticommutator, 53 
antiisomorphism, 19 
antilinear, see map, antilinear 
antipodal map, see map, antipodal 
arc length parameterized curve, 214 
atlas, 63 
automorphism 
general, 267 
linear, 3 

autoparallel, see geodesic 
axial vector, see pseudovector 

Baker-Campbell-Hausdorff formula, 110 
ball, 57 

barycentric coordinates, 141 


base space, 177 
basis, 2 

completion of, 3 
dual, 8 

of a topology, see topology, basis of 
of sections, 178 
Betti number, 145 
Bianchi identity, 184 
bijective map, see map, bijective 
bilinear form, 14 
symmetric, 14 
bosonic particle state, 53 
boundary 

of a manifold, 1 67 
of a poly tope, 140 
of a simplex, 143 
of a simplicial chain, 145 
of a singular chain, 147 
boundary complex, 152 
boundary operator, 143 
boundary space 
singular, 147 

bracket-generating distribution, 294 
bridge, 302 

Brouwer fixed point theorem, 133 

bump form, 1 32 

bundle 

cotangent, 180 
tangent, 180 
tensor product, 180 
bundle projection, 177 
bundle space, see total space 

canonical commutation relations, 53 
canonical transformation, 114 
cardinality, 264 
Cartan’s 

equations of structure, 1 85 
lemma, 50 

Cartesian product, 265 
catenoid, 236 
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simplicial, 143 
singular, 146 
smooth singular, 158 
chain complex, 150 
chain map, 126 

change of variables theorem, 1 62 
characteristic class, 1 89 

characteristic polynomial, see matrix, characteristic 
polynomial of 

Chern-Gauss-Bonnet theorem, 262 
chord, 303 

Chow-Rashevskii theorem, 294 
Christoffel symbol, 202 
class representative, 6 
classification theorem, 155 
closed 
form, 116 
manifold, 155 
set, 56 
closure, 56 
cobasis, 8 
coboundary, 125 
smooth singular, 171 

coboundary operator, see connecting homomorphism 
co-closed form, 223 
cocycle, 125 

smooth singular, 170 

cocycle conditions (on a vector bundle), 178 

Codazzi-Mainardi equation, 310 

co-differential operator, 222 

cofactor, see matrix, cofactor 

coframe field, 80 

cohomologous closed forms, 123 

cohomology 

of a complex, 125 
smooth singular, 170 
cokemel, 21 

column rank, see rank, of a matrix 
commutative diagram, 126 
commutator, 53 
compact set, 56 
complement, 263 

complete manifold, see manifold, complete 
complete topological invariant, 58 
completely integrable, 286 
complex 

de Rham, 1 26 
differential, 125 
component function, 59 
components 
contracted, 12 
contravariant, 11 
covariant, 11 
of a link, 255 
vector, 2 

composition, see map, composition of 
compound matrix, see matrix, compound 
cone, 118 


conformally flat, 314 

conjugate linear map, see map, conjugate linear 
conjugation, 108 
connected component, 123 
connected sum, 1 54 
connected space, 123 
k-connected space, 136 
connecting homomorphism, 127 
connection, 182 
flat, 209 

Levi-Civita, 200 
matrix, 183 
Riemannian, 200 
constraint, 291 
holonomic, 291 
continuous 
image, 58 

map, see map, continuous 
contractible space, 118 

contragredient matrix, see matrix, contragredient 
contravariant components, see components, 
contravariant 

convex 

combination, 139 
hull, 139 
set, 139 
coordinate 
chart, 63 

frame, see frame field, holonomic 
functions, 60 
map, 62 

neighborhood, see coordinate patch 
patch, 62 
coordinates, 60 

geodesic, see coordinates, Riemann normal 
isothermal, 314 
local inertial or Lorentz, 278 
locally flat, 278 
Riemann normal, 277 
cotangent bundle, see bundle, cotangent 
cotangent space 

as dual of tangent space, 79 
as jet space, 81 
countable set, 264 
covariant 

components, see components, covariant 
derivative, 182 
covariant derivative, 1 82 
covector, 8 

components of, 8 

covector field, see differential form, 1 -form 
cover, 111 
universal, 111 
covering space, see cover 
Cramer’s rule, 22 
critical point, 7 1 
crosspolytope, see polytope 
curl, 242 
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curvature 
constant, 233 
Gauss-Kronecker, 312 
Gaussian, 233 
invariant, 207 
matrix, 186 
mean, 312 
operator, 186 
principal, 312 
tensor, 188 
2-forms, 184 
CW complex, 149 
cycle 

elementary, 303 
in a graph, 299 
of a simplicial complex, 145 
cycle rank, 303 
cycle space 

of a simplicial complex, 145 
singular, 147 
cyclic group, 266 

Darboux’s theorem, 113 
de Morgan’s laws, 263 
de Rham 

cohomology, 123 
first theorem, 170 
second theorem, 170 
theorem, 172 

deformation retraction, 125 
degree 

of a map, 25 1 
dense set, 56 
derivation, 75 
derivative 
Lie, 102 

of a smooth map, 98 
of map on R”, 59 
partial, 59 
determinant 

of a linear map, 43 
of a matrix, 17 

diagonalizable matrix, see matrix, diagonalizable 
diagonalizing a matrix, 27 1 
diffeomorphic spaces, 69 
diffeomorphism 
of manifolds, 69 
on R n , 59 

difference (of sets), 263 
differential 

of a complex, 125 
of a smooth map, 98 
differential form, 87 
1-form, 79 

differential ideal, 290 
dimension 

of a face of a poly tope, 140 
of a simplicial complex, 142 


of a vector space, 2 
direct sum, 2 

discrete topology, see topology, discrete 

distribution, 284 

divergence, 242 

division ring, see skew field 

domain, 263 

dot product, 15 

double cover, see cover, double 

double dual, 9 

dual map, see map, dual 

dual pairing, 8 

dual space, 8 

E8-space, 145 
edge 

of a graph, 296 
of a poly tope, 140 
eigenvalue, 24 
eigenvector, 24 

Einstein manifold, see manifold, Einstein 
Einstein summation convention, 33 
Einstein tensor, 207 
electromagnetic 2-form, 92 
elementary symmetric function, 25 
elliptic curve, 105 
embedding, 71 
endomorphism 
general, 267 
linear, 3 

Enneper’s surface, 316 

epsilon symbol, see Levi-Civita alternating symbol 
epsilon tensor, see Levi-Civita alternating tensor 
epsilon tensor identities, 240 
equivalence 
class, 264 
principle, 278 
relation, 264 
equivalence mod n, 264 
Euclidean 
distance, 57 
group, 228 
metric, 196 
space, 15 
Euler 

characteristic, 149 
class, 262 
exact form, 117 
exact sequence, 4 
split, 5 

exponential map 
on a Lie group, 107 
on a Riemannian manifold, 276 
extended by linearity, 3 
exterior 

algebra, see algebra, exterior 

derivative, 89 

power 
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of a map, 42 
of a vector space, 39 
product, 38 

extrinsic curvature tensor, see second fundamental 
form 

face 

of a polytope, 140 
of a simplex, 141 
facet, 140 

Fermi normal coordinates, 214 
fermionic particle states, 53 
fiber, 177 
field, 269 
finite, 269 
field strength, 192 
first fundamental form, 310 
flat space, 209 
flow (of a vector field), 101 
Fock space, 52 
foliation, 289 
forest, 302 
frame field, 78 
holonomic, 194 
nonholonomic, 194 
orthonormal, 196 
Frobenius’ theorem, 286 
function linearity, 85 
functor 

contravariant, 98 
covariant, 98 
fundamental group, 1 36 

gauge 

covariant derivative, 192 
field theory, 191 
group, 191 
potential, 191 
transformation, 191 
Gauss-Bonnet theorem, 261 
Gauss equation, 310 
Gauss’s formula, 258 

Gauss-Kronecker curvature, see curvature, 
Gauss-Kronecker 
Gauss map, see map. Gauss 
Gaussian curvature, see curvature, Gaussian 
general linear group, 105 
generator of cohomology, 132 
genus, 155 
geodesic, 213 
null, 214 

geodesic deviation, 215 
geodesic deviation equation, 216 
geodesic equation, 213 
geometric 
manifold, 193 
multiplicity, 24 
geometry, 193 


graded filtration, 81 
gradient, 242 

Gram matrix, see matrix, Gram 
Gram-Schmidt orthonormalization, 28 
Grammian, 17 
graph, 296 
group, 265 

general linear, 105 
orthogonal, 105 
unitary, 108 
group action, 268 

hairy ball theorem, 253 
half-space, 140 
Hamiltonian, 114 

Hamiltonian vector field, see vector field, 
Hamiltonian 
harmonic form, 223 

Hausdorff topology, see topology, Hausdorff 
heat equation, 315 

hedgehog theorem, see hairy ball theorem 
Hermitian matrix, see matrix, Hermitian 
Hilbert space, 52 
Hodge 

decomposition theorem, 223 
dual 

of a p vector, 45 
star, see Hodge dual 
theorem on harmonic forms, 224 
Hodge-de Rham Laplacian, see Laplace-de Rham 
operator 

holonomy group, 219 
restricted, 219 
homeomorphic spaces, 58 
homeomorphism, 58 
homologous cycles, 145 
homology 
singular, 147 
smooth singular, 159 
homology group 
simplicial, 145 
homomorphism 
group, 267 
linear, 3 

of complexes, 126 
homotopic 
maps, 118 
spaces, 118 

homotopically trivial map, see map, null homotopic 
homotopy, 118 
homotopy group 

first, see fundamental group 
homotopy operator, 1 20 
homotopy type, 118 
Hopf fibration, 254 
hyperbolic space, 233 
hyperplane, 140 
hypersurface, 310 
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idempotent map, 4 
identification, 66 
image, 263 

imbedding, see embedding 
immersion, 70 

impedance matrix, see matrix, impedance 
incidence matrix, 297 
inclusion map, 5 
inclusion-exclusion, 154 

indefinite inner product, see inner product, indefinite 
index 

of a map, 250 
of a vector field, 260 
indices 

contravariant, 11 
covariant, 11 
raising and lowering, 197 
injective map, see map, injective 
inner product, 14 
Euclidean, 15 
indefinite, 15 
Lorentzian, 15 
negative definite, 15 
nonnegative definite, 15 
nonpositive definite, 15 
positive definite, 1 5 
inner product space, 14 
integrable distribution, 284 
integral 

Riemann, 160 
integral curve, 100 
integral submanifold, 284 
interior 

of a poly tope, 140 
of a set, 56 
interior product, 93 
intersection, 263 
into map, see map, injective 
invariant subspace, 27 1 
inverse 

function theorem, 59 
image, 263 

involutive distribution, 285 
isometric geometric manifolds, 193 
isomorphic 
groups, 267 
vector spaces, 3 
isomorphism 
linear, 3 
of groups, 267 
of Lie algebras, 107 
isospin, 190 

isothermal coordinates, see coordinates, isothermal 
isotopy, 255 

Jacobi field, 215 
Jacobi identity, 78 

Jacobi’s equation, see geodesic deviation equation 


Jacobian 

determinant, 59 
matrix, 59 
jet space, 8 1 

join of simplicial complexes, 152 

kernel, 3 
Killing 
field, 227 
form, 110 

Killing’s equation, 227 
knot, 255 

Koszul formula, 200 
Kronecker 
delta, 8 
symbol, 240 

product, see matrix, Kronecker product 

Laplace-Beltrami operator, 242 

Laplace-de Rham operator, 223 

Laplace expansion, 22 

Laplacian, see Laplace-Beltrami operator 

Laplacian matrix, see matrix, Laplacian 

leaf, 289 

Lefschetz 

fixed point theorem, 157 
number, 157 

Lefschetz-Hopf theorem, 260 
left-invariant vector field, 106 
Legendre polynomial, 28 
Levi-Civita 

alternating symbol, 238 
alternating tensor, 240 
connection, see connection, Levi-Civita 
Lie algebra 

of a Lie group, 106 
of vector fields, 78 
Lie derivative, see derivative. Lie 
Lie group, 105 
Lie subalgebra, 108 
limit point, see accumulation point 
line element, 197 
linear 

functional, 8 
independence, 2 
map, see map, linear 
link, 255 
linking number 
of links, 255 
of manifolds, 258 
Liouville’s theorem, 114 
local 

coordinates, 64 
trivialization, 177 
locally Euclidean manifold, 62 
locally finite open cover, 56 
locally flat 

coordinates, see coordinates, locally flat 
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manifold, 210 

locally straight integral submanifold, 286 
loop in a manifold, 136 


manifold 

complete, 233 
Einstein, 238 
isotropic, 232 
Lorentzian, 194 
orientable, 64 
orientation 

by Jacobian, 64 
induced, 167 
orientation class, 275 
parallelizable, 179 
pseudo-Riemannian, 194 
Riemannian, 193 
smooth, 62 
symplectic, 113 
topological, 62 
with boundary, 167 
map, 263 
adjoint, 20 
antilinear, 14 
antipodal, 252 
bijective, 263 
composition of, 264 
conjugate linear, 19 
continuous, 57 
Gauss, 311 
idempotent, 4 
identity, 264 
inclusion, 5 
injective, 263 
inverse, 264 
linear, 3 
multilinear, 34 
null homotopic, 118 
smooth 

of Euclidean spaces, 59 
of manifolds, 69 
sphere, 3 1 1 
surjective, 263 
matrix 
adjoint, 20 
adjugate, 22 
anti-Hermitian, 108 
change of basis, 10 
characteristic polynomial of, 24 
cofactor, 22 
compound, 43 
congruence of, 1 8 
contragredient, 10 
diagonalizable, 24 
Gram, 17 
Hermitian, 27 1 
impedance, 301 
incidence, 297 


Kronecker product, 49 
Laplacian, 303 
minor, 22 
rotation, 23 
roto-reflection, 23 
singular, 22 
transpose of, 10 
unitary, 271 
matrix group, 105 
matrix tree theorem, 303 
matter field, 191 
Maurer-Cartan form, 107 
Maxwell’s equations, 91 
Mayer- Vietoris sequence 
long, 130 
short, 129 

mean curvature, see curvature, mean 
mean curvature vector, 314 
mesh current, 304 
metric, 193 
bi-invariant, 237 
covariant constancy, 226 
induced, 235 
minimal surface, 315 
Minkowski 
metric, 196 
spacetime, 15 
minor, see matrix, minor 
Mobius 
band, 66 
group, 235 
strip, 66 

transformation, 235 
moment curve, 163 
moving frame, see frame field 
multigraded algebra, 33 
multi-index notation, 40 
multivalued function, 117 


natural pairing, see dual pairing 
negative definite inner product, see inner product, 
negative definite 
neighborhood, 55 
Newton’s identities, 26 
nilpotent, 89 

node, see vertex, of a graph 
nondegenerate 
dual pairing, 9 
form, 14 

nonnegative definite inner product, see inner product, 
nonnegative definite 

nonpositive definite inner product, see inner product, 
nonpositive definite 
nonsingular matrix, 22 
normal 

bundle, 308 
space, 308 
vector, 308 
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vector field, 308 
nucleon, 190 

null homotopic map, see map, null homotopic 
null space, see kernel 
nullity, 3 

1-form, see differential form, 1-form 
one-parameter group of diffeomorphisms, see flow (of 
a vector field) 

one-parameter subgroup, 106 
one-point map, 1 1 8 
one-to-one map, see map, injective 
onto map, see map, surjective 
open cover, 56 
operator 

Hermitian, 27 1 
linear, 3 

order of a tensor, 30 
order of vanishing, 81 
ordered set, 265 

orientable manifold, see manifold, orientable 
orientation class, see manifold, orientation class 
oriented simplicial complex, see simplicial complex, 
oriented 
orthogonal 

complement, 17 
group, 23, 105 
matrix, 23 
transformation, 23 
vectors, 16 

orthonormal set of vectors, 16 

parallel transport map, 213 

parallel transport or translation, 212 

parallelizable manifold, see manifold, parallelizable 

parameterization of a manifold, 70 

parity transformation, 239 

partition of a set, 265 

partition of unity, 129 

passive transformation, 69 

path 

identity, 135 
in a graph, 299 
in a manifold, 134 
inverse, 135 
product, 135 

path-connected space, 1 36 
path homotopy, 135 
path ordered exponential, 218 
path ordering operator, 218 
Pauli matrices, 111 
period (integral), 170 
permutation, 266 

permutation group, see symmetric group 
Pfaffian 

of a matrix, 261 
system, 289 
Plateau’s problem, 315 


Platonic solid, 152 
Poincare 

conjecture, 208 
duality, 224 
lemma, 120 
Poisson bracket, 114 
polar vector, 239 
polygon, 152 
polyhedron, 140 
polytope, 139 

crosspolytope, 152 
simplicial, 152 

positive definite inner product, see inner product, 
positive definite 

power sum symmetric function, 25 
preimage, see inverse image 
principal 
bundle, 191 

curvatures, see curvature, principal 
directions, 312 

product (of sets), see Cartesian product 
projection operator, 4 
projection, canonical, 6 
projective general linear group, 235 
proper 

distance, 214 
face, 140 
time, 214 
pseudoscalar, 239 
pseudotensor, 239 
pseudo vector, 239 
pullback, 95 

punctured Euclidean space, 125 
pushforward, 97 

quaternions, 269 
quotient 

of two topological spaces, see topology, quotient 
of two vector spaces, 6 

range, 263 
rank 

of a linear map, 3 
of a matrix, 8 
of a tensor, 32 
of a vector bundle, 177 
rank-nullity theorem, 5 
refinement (of a cover), 56 
regular point, 7 1 
regular value, 7 1 
regular value theorem, 72 
representation 
of a group, 108 
of a Lie algebra, 109 
of a linear map, 7 
retraction, 125 
Ricci 

curvature scalar, 207 
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flow, 208 
identity, 226 
tensor, 206 

Riemann curvature tensor, 204 
Riesz’s lemma, 19, 29 
ring, 268 

rotation matrix, see matrix, rotation 
roto-reflection matrix, see matrix, roto-reflection 
row rank, see rank, of a matrix 

scalar, 1, 268 
Schlafli symbol, 153 
Schur’s theorem, 233 
Schwarzschild line element, 229 
second 

fundamental form, 310 
fundamental tensor, 310 
section 

of a bundle, 178 
of linear map, 5 
sectional curvature, 232 
self-adjoint matrix, see matrix, Hermitian 
semicolon notation, 225 
semisimple Lie algebra, 110 
sesquilinear form, 14 
set, 263 

shape operator, 311 
short exact sequence 
of complexes, 127 
of vector spaces, 4 
sign (of a permutation), 266 
signature, 18 
similarity 
invariant, 24 
transformation, 24 
simplex 

geometric, 141 
open, 141 
oriented, 142 
singular, 146 
smooth singular, 158 
standard, 159 
simplicial complex, 142 
oriented, 143 

simplicial polytope, see polytope, simplicial 
simply connected topological space, 208 
singular matrix, see matrix, singular 
singularity, see zero (of a vector field) 
skew field, 269 

skew symmetric tensor, see tensor, antisymmetric 
skew-Hermitian matrix, see matrix, anti-Hermitian 
smooth function 
on R”, 59 
on manifold, 69 
smooth map 

of Euclidean spaces, see map, smooth 
of manifolds, see map, smooth 
space, see topological, space 


spanning set of vectors, 2 
spanning tree, see tree, spanning 
spectral theorem, 27 1 
sphere map, see map, sphere 
spherical harmonic, 247 
spinor, 111 

stack of records theorem, 249 
standard basis, 2 

standard simplex, see simplex, standard 
stereographic projection, 64 
Stokes’ theorem 
chain version, 1 64 
manifold version, 167 
stress energy tensor, 207 
structure 

constants. 111 
functions, 202 
submanifold, 70 
submersion, 7 1 
subset, 263 
subspace 

of a vector space, 2 
topological, 58 
support 

of a form, 164 
of a function, 129 
supporting hyperplane, 140 
surjective map, see map, surjective 
suspension, 119 
suspension isomorphism, 138 
Sylvester-Franke theorem, 50 
Sylvester’s law of inertia, 18 
symmetric 

algebra, see algebra, symmetric 
group, 266 
symplectic 
form, 113 
map, 114 

symplectomorphism, 114 
tangent 

bundle, see bundle, tangent 
space, 75 
vector, 75 

Tellegen’s theorem, 301 
tensor, 30 

antisymmetric, 37 
inertia, 32 
of type (r, 5 ), 33 
order vs. rank, 32 
symmetric, 37 
tensor algebra, 3 1 
tensor density, 239 
tensor field, 84 
tensor product, 30 
of linear maps, 49 
symmetric, 51 

tensor product bundle, see bundle, tensor product 
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theorema egregium, 312 
topological 
space, 55 

topological invariant, 58 
topology 
basis of, 57 
coarser, 58 
discrete, 55 
Euclidean, 57 
finer, 58 
Hausdorff, 56 
induced, 58 
on a set, 55 
product, 57 
quotient, 58 
standard 
on R, 57 
on R” , 57 

subspace, see topology, induced 
torsion 
form, 201 

of a connection, 199 
of an abelian group, 145 
tensor, 199 
total space, 177 
trace 

of a matrix, 24 
of an endomorphism, 26 
transition function 
of a bundle, 178 
of a manifold, 63 
translate (subspace), 2 
transpose, see matrix, transpose of 
transposition, 266 
tree, 302 
number, 303 
spanning, 302 

triangulable topological space, 145 
triangulation 
continuous, 144 
smooth, 159 
trivial bundle, 178 


uncountable set, 264 
union, 263 
unit vector, 1 6 
unitary 
group, 108 

matrix, see matrix, unitary 
universal coefficient theorem, 172 
universal cover, see cover, universal 
unknot, 255 
unlink, 255 

valuation, 154 
vector, 1 
p- vector, 39 
vector bundle, 176 
vector field, 77 
Hamiltonian, 114 
vector potential, 122 
vector space, 270 
vertex 

of a graph, 296 
of a poly tope, 140 
volume form 
in general, 274 
on a geometric manifold, 22 1 

wedge 

product, see exterior product 
sum, 138 

Weingarten map, see shape operator 
Weitzenbock formula, 245 
Whitney 

approximation theorem, 120 
embedding theorem, 7 1 
sum, 189 

winding number, 252 

Yang-Mills equations, 192 

zero of a vector field, 259 
zero set, 289 


